You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Find a way to retrieve all the records from package_search
Apply it in CkanDatasetsIdCollector to retrieve all the dataset Ids from CKAN
💡 Why? (Justification)
In this service, we provide to the user the possibility to search datasets, eventually using different sources (e.g. CKAN, Beacon).
Internally, the first step is to find the ids of the datasets which match the user query, for each data source separately.
Then, we reconcile the different id sets by taking the intersection of those.
Consequently, the final intersection can be smaller than what the initial query was looking for.
In this regard, we would like to retrieve ALL the dataset ids for every data source, before the merge, to reduce as much as possible the chances of getting less records than inquired.
CKAN is problematic because it limits us to 1000 records maximum (by default).
🔨 Tasks (Breakdown)
Find a way to retrieve all the records from package_search
Apply it in CkanDatasetsIdCollector to retrieve all the dataset ids from CKAN
✅ Acceptance Criteria
Can we retrieve all the dataset ids from CKAN?
➕ Additional Information
No response
The text was updated successfully, but these errors were encountered:
🎯 What? (Story Description)
Find a way to retrieve all the records from
package_search
Apply it in
CkanDatasetsIdCollector
to retrieve all the dataset Ids from CKAN💡 Why? (Justification)
In this service, we provide to the user the possibility to search datasets, eventually using different sources (e.g. CKAN, Beacon).
Internally, the first step is to find the ids of the datasets which match the user query, for each data source separately.
Then, we reconcile the different id sets by taking the intersection of those.
Consequently, the final intersection can be smaller than what the initial query was looking for.
In this regard, we would like to retrieve ALL the dataset ids for every data source, before the merge, to reduce as much as possible the chances of getting less records than inquired.
CKAN is problematic because it limits us to 1000 records maximum (by default).
🔨 Tasks (Breakdown)
Find a way to retrieve all the records from
package_search
Apply it in
CkanDatasetsIdCollector
to retrieve all the dataset ids from CKAN✅ Acceptance Criteria
Can we retrieve all the dataset ids from CKAN?
➕ Additional Information
No response
The text was updated successfully, but these errors were encountered: