Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

实现爬虫功能 #14

Open
tjworks opened this issue Nov 30, 2016 · 1 comment
Open

实现爬虫功能 #14

tjworks opened this issue Nov 30, 2016 · 1 comment

Comments

@tjworks
Copy link
Contributor

tjworks commented Nov 30, 2016

可以收集MongoDB相关的:

  • 博客(如CSDN?)
  • 问答 (google Groups,stackoverflow)
  • 文章
@zhangyaoxing
Copy link
Collaborator

可能要确定下范围,我们的爬虫是不是只对指定内容和网站感兴趣的垂直搜索?比如说只搜stackoverflow下面带有mongodb标签的问题,只搜CSDN上面带有mongodb关键字的讨论?
或者是根据链接来爬,不管是博客、问题或者文章都抓下来,最后才筛选比如出现过mongodb关键字的内容?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants