Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

APS: duplicated affids removal #116

Merged
merged 1 commit into from
Dec 7, 2023
Merged

Conversation

ErnestaP
Copy link
Contributor

@ErnestaP ErnestaP commented Dec 6, 2023

No description provided.

@@ -13,9 +13,11 @@

from scrapy.http import TextResponse
from hepcrawl.spiders import aps_spider
from hepcrawl.parsers import aps_apr
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

what is this? 🤔

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

A mistake, removed!

@ErnestaP ErnestaP force-pushed the APS-duplicate branch 2 times, most recently from 8ebb7fa to 2005e8b Compare December 7, 2023 10:34
'surname': u'Sethna'}]

sorted_expected_results = sorted(
expected_results, key=lambda x: x['affiliations'][0]['value'] if x['affiliations'] else '')
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

@ErnestaP why do we have to sort the ‘expected_results’? Lists are maintaining the order, that means we just need to put the data in the correct order.

@ErnestaP ErnestaP force-pushed the APS-duplicate branch 2 times, most recently from 1a3c401 to fb91246 Compare December 7, 2023 12:05
@@ -64,6 +64,7 @@
{
"surname":"Alemi",
"affiliationIds":[
"a1",
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

So, at the end we are not testing this anywhere, right? So no need to change it

Copy link
Contributor Author

@ErnestaP ErnestaP Dec 7, 2023

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

we are testing it, we see that parsed author does not have repetitive affiliations

Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

but where do we assert it?

@drjova drjova merged commit f9118d7 into SCOAP3:master Dec 7, 2023
1 check passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants