Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

为啥不random_shuffle呢?训练集验证集测试集也没有分 #3

Open
bright1993ff66 opened this issue May 22, 2018 · 4 comments

Comments

@bright1993ff66
Copy link

生成完毕PCA_X以后应该shuffle一下吧,直接用SVM跑不太好吧
而且训练集验证集测试集也没有分,hyperparameter也需要tune一下吧

@CxwDelete
Copy link

问一下,第四个文件4_getwordvecs.py,加载模型怎么回事,Wiki.zh.text.vector 是什么文件

@bright1993ff66
Copy link
Author

Wiki.zh.text.vector是之前基于wiki中文训练出来的pre-trained word embedding啊
getwordvecs.py就是拿到那个获得词向量啊

@wanghuahua2019
Copy link

我也有疑问,它训练和后面测试的不是相同的数据吗?这样准确率肯定很高啊

@yang1637653089
Copy link

Wiki.zh.text.vector是之前基于wiki中文训练出来的pre-trained word embedding啊 getwordvecs.py就是拿到那个获得词向量啊

分享一下宝
Wiki.zh.text.vector

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

4 participants