[gdi-userportal-dataset-discovery-service] Remove CKAN max rows limitation #104

admy7 · 2024-08-27T18:07:47Z

🎯 What? (Story Description)

Find a way to retrieve all the records from package_search
Apply it in CkanDatasetsIdCollector to retrieve all the dataset Ids from CKAN

💡 Why? (Justification)

In this service, we provide to the user the possibility to search datasets, eventually using different sources (e.g. CKAN, Beacon).

Internally, the first step is to find the ids of the datasets which match the user query, for each data source separately.
Then, we reconcile the different id sets by taking the intersection of those.
Consequently, the final intersection can be smaller than what the initial query was looking for.

In this regard, we would like to retrieve ALL the dataset ids for every data source, before the merge, to reduce as much as possible the chances of getting less records than inquired.
CKAN is problematic because it limits us to 1000 records maximum (by default).

🔨 Tasks (Breakdown)

Find a way to retrieve all the records from package_search
Apply it in CkanDatasetsIdCollector to retrieve all the dataset ids from CKAN

✅ Acceptance Criteria

Can we retrieve all the dataset ids from CKAN?

➕ Additional Information

No response

The text was updated successfully, but these errors were encountered:

admy7 added the user story label Aug 27, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[gdi-userportal-dataset-discovery-service] Remove CKAN max rows limitation #104

[gdi-userportal-dataset-discovery-service] Remove CKAN max rows limitation #104

admy7 commented Aug 27, 2024 •

edited

Loading

[gdi-userportal-dataset-discovery-service] Remove CKAN max rows limitation #104

[gdi-userportal-dataset-discovery-service] Remove CKAN max rows limitation #104

Comments

admy7 commented Aug 27, 2024 • edited Loading

🎯 What? (Story Description)

💡 Why? (Justification)

🔨 Tasks (Breakdown)

✅ Acceptance Criteria

➕ Additional Information

admy7 commented Aug 27, 2024 •

edited

Loading