CLI utility that facilitates the recopilation of a corpus based on Stack Exchange sites. Sentences with their corresponding labels are stored in 'csv' files.
git clone https://github.com/searleser97/stackexchange-corpus-builder
cd stackexchange-corpus-builder
npm run install:dev
Run:
scb
or
stackexchange-corpus-builder
Note: When logging into https://data.stackexchange.com avoid using "google" since it detects that your browser has automation enabled, the preferred method is to use "login with stackoverflow" and use the traditional "email" and "password" fields.