Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Streamin Data Sources #5

Open
tarikaltuncu opened this issue Sep 26, 2020 · 3 comments
Open

Streamin Data Sources #5

tarikaltuncu opened this issue Sep 26, 2020 · 3 comments

Comments

@tarikaltuncu
Copy link

Hi, is D-ETM good for streamin data sources? How do you suggest training in mini-batches of data as it accumulates?

I also have another related question. I understand that DETM can adjust the model for concept drifts by shifting the mean and varience of topics by gaussian distribution. What if there is a particular topic that exists only (and heavily) in only one (or a few) particular time interval. Would DETM be able to detect it? Or is there an assumption that all topics exist at all time intervals?

@Emekaborisama
Copy link

No, from my experience, all the topics don't exist at all time.

Yes, there will surely be a topic that occurs only one in a particular time interval.

@Emekaborisama
Copy link

When you say streaming data source, do you mean realtime data from google sheet or SQL?

@mona-timmermann
Copy link

You might want to have a look at the paper on Dynamic Mixture Models by Wei et al. (2007) and similar papers on topic modeling for streaming data.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants