Correction in "Testing and Training Sets" section by fratzola · Pull Request #1 · Harvard-IACS/computefest2017-pythonml

fratzola · 2017-01-15T00:04:50Z

Correction in "Testing and Training Sets" section, in:
df=pd.DataFrame(dict(x=x[indexes],f=f[indexes],y=y[indexes]))

from sklearn.cross_validation import train_test_split
datasize=df.shape[0]
#split dataset using the index, as we have x,f, and y that we want to split.
itrain,itest = train_test_split(range(30),train_size=24, test_size=6)
xtrain= df.x[indexes[itrain]].values
ftrain = df.f[indexes[itrain]].values
ytrain = df.y[indexes[itrain]].values
xtest= df.x[indexes[itest]].values
ftest = df.f[indexes[itest]].values
ytest = df.y[indexes[itest]].values

Dict creates different indexing so in order for the itrain and itest indices to be correct they have to pass through 'indexes'!!

otherwise there's a lot of Nan values that should be there.

Correction in "Testing and Training Sets" section, in: df=pd.DataFrame(dict(x=x[indexes],f=f[indexes],y=y[indexes])) from sklearn.cross_validation import train_test_split datasize=df.shape[0] #split dataset using the index, as we have x,f, and y that we want to split. itrain,itest = train_test_split(range(30),train_size=24, test_size=6) xtrain= df.x[indexes[itrain]].values ftrain = df.f[indexes[itrain]].values ytrain = df.y[indexes[itrain]].values xtest= df.x[indexes[itest]].values ftest = df.f[indexes[itest]].values ytest = df.y[indexes[itest]].values # Dict creates different indexing so in order for the itrain and itest indices to be correct they have to pass through 'indexes'!! # otherwise there's a lot of Nan values that should be there.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Correction in "Testing and Training Sets" section#1

Correction in "Testing and Training Sets" section#1
fratzola wants to merge 1 commit intoHarvard-IACS:masterfrom
fratzola:patch-1

fratzola commented Jan 15, 2017

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

fratzola commented Jan 15, 2017

Dict creates different indexing so in order for the itrain and itest indices to be correct they have to pass through 'indexes'!!

otherwise there's a lot of Nan values that should be there.

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant