Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

V2 #18

Merged
merged 134 commits into from
Jan 10, 2023
Merged

V2 #18

merged 134 commits into from
Jan 10, 2023

Conversation

breuleux
Copy link
Member

The version of Paperoni in master hasn't been usable in a while, might as well replace it with the v2 branch right now.

notoraptor and others added 30 commits October 20, 2021 13:42
* Add a query manager for Semantic Scholar.
* Add command `semantic-scholar <keywords>` to search by keywords using Semantic Scholar API.
Add first version of SQL database schema and a simple test.
* Add the command `paperoni import -j <input.json> -c <database.db> [-a <author name>] [-v]`
* Link authors to their MAG ids in the DB.
* Update dependencies
* Move semantic_scholar to sources/
* Support rate limiting
* Use query results for generators, auto-chain requests
* Adjust table names
* Remove useless verbose print in command_import, as tqdm already displays useful information.

* Save MAG ID into table paper_link when importing papers from JSON to SQL.

* Collect papers from Semantic Scholar.

* Add boolean column `paper.excluded` to manage excluded papers.
Check paper IDs to find a paper in SQL database.

* Check paper and author MAG ID when importing from JSON to SQL.
Search for author using available IDs.
Move SQL Collection object into a specific file and add new methods get_paper() and query().
Return nothing if no parameter given.
Save citation count when importing from JSON (Microsoft Academic Research) to SQL.

Set parameters start and end to None by default.

Fix help message for parameter limit.

Print venue correctly in papers2.Paper.format_term().

Fix date limit where year is specified.
- Add unit tests for each SQL search option.
[import]
- Make sure to not keep empty affiliation in paper_author table if available affiliation is found.
- Do not keep empty affiliation in old Author class.
Co-authored-by: Olivier Breuleux <[email protected]>
@breuleux breuleux merged commit 07a6136 into master Jan 10, 2023
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants