sklearn.model_selection.train_test_split出现Python错误:ValueError:找到输入数据的样本数不一致:[416858，398427]

本文介绍了sklearn.model_selection.train_test_split出现Python错误:ValueError:找到输入数据的样本数不一致:[416858，398427]的处理方法，对大家解决问题具有一定的参考价值，需要的朋友们下面随着小编来一起学习吧！

问题描述

我的标签数量与样本数量不匹配，因此我认为解决方案是删除一些样本数据，但总体而言，这不是一个好习惯.

My number of labels doesn't match the number of samples, so I think a solution would be to remove some of the sample data, but I think that's not a good practice overall.

这是我的代码:

X = np.loadtxt('/Users/myname/PycharmProjects/my_project/X.txt')
y = np.loadtxt('/Users/myname/PycharmProjects/my_project/y.txt')

print np.shape(X)
print np.shape(y)
X_train, X_test, y_train, y_test = train_test_split(X, y, test_size=.3)

我得到了错误:

ValueError: Found input variables with inconsistent numbers of samples: [416858, 398427]

任何人都可以解释一下我需要做些什么来解决它吗?

Can anyone explain what I would need to do to fix it?

the

sklearn.model_selection.train_test_split出现Python错误:ValueError:找到输入数据的样本数不一致:[416858，398427]

问题描述

推荐答案