Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

which parameter to change to run demo for n epochs? #10

Open
muthiyanbhushan opened this issue Mar 6, 2018 · 2 comments
Open

which parameter to change to run demo for n epochs? #10

muthiyanbhushan opened this issue Mar 6, 2018 · 2 comments

Comments

@muthiyanbhushan
Copy link

muthiyanbhushan commented Mar 6, 2018

Hello,

I had gone through all of the code. I could not get how to change the #epochs or #iter parameter and also #threads parameter like in Google-C code.

Can you give little glimpse on it.

I found n_workers parameter but by changing that parameter I could not get much more timing difference. Changing n_workers from 4 to 16 only improved 5 sec of training time.

Please let me know.

Thanks.

@muthiyanbhushan muthiyanbhushan changed the title which parameter to change to run the demo for n epochs? which parameter to change to run demo for n epochs? Mar 6, 2018
@jdeng
Copy link
Owner

jdeng commented Mar 7, 2018

IIRC the code relies on OpenMP for parallel training so the n_workers param is probably not useful. If you compile with OpenMP support you should be able to use all the CPUs.
You can take a look at the train() function to add extra loops to train more iterations.

@muthiyanbhushan
Copy link
Author

muthiyanbhushan commented Mar 8, 2018

Hello @jdeng,
Thanks I changed the code for more number of iterations by adding a for loop. Now there are 3 nested for loops in the train function.
I have few questions:

  1. Does one iteration means training over whole 17 Million words in text8 corpus?
  2. Also, wanted to know what batch size is being used in the code.
  3. Was your original github code training for just 1 iterations ?
  4. Does the concept #iteraions in original word2vec code same as this one?

https://github.com/svn2github/word2vec

Please, let me know.

Thanks.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants