You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Get Overview about data available via api. This will be documented here:
dataset data openml.datasets.list_datasets(output_format="dataframe")
did: unique dataset ID
name: non unique
version: int, the combination of name and version seems to be unique in every case but one
uploader: int (maybe this is a user id??)
status: "active" for all of them
format: one of ARFF, SParse_ARFF, arff or sparse_arff
MajorityClassSize: number or NaN
MaxNominalAttDistinctValues: number or NaN
MinorityClassSize: number or NaN
NumberOfClasses: number or NaN
NumberOfFeatures: number or NaN
NumberOfInstances: number or NaN
NumberOfInstancesWithMissingValues: number or NaN
NumberOfMissingValues: number or NaN
NumberOfNumericFeatues: number or NaN
NumberOfSymbolicFeatures: number or NaN
evaluations (have to give evaluation function)
run_id: run id
task_id: task id
setup_id: setup id
flow_id: flow id
flow_name: flow name
data_id: dataset id?
data_name: dataset name?
function: evaluation function
upload_time: time it was uploaded
uploader: uploader number
uploader_name: name string
value: int
values: always None?
array_data: always None?
flows
id: unique id
full_name: name with number in parentheses
name: name of python class or function\
version: number
external_version: None or package versions with package name in the form 'openml==0.14.1,sklearn==1.3.0'
uploader: number
runs
run_id: unique id
task_id: task id
setup_id: setup id
flow_id: flow id
uploader: number
task_type: instance of task type in the following form: TaskType.LEARNING_CURVE
upload_time: time in the format of 2014-04-06 23:30:40
error_message: string
setups:
setup_id: unique id
flow_id: flow id
parameters: dict of things that are given as numbers; the dicts contain information such as flow information, data_type, default_value etc
study openml.study.list_studies(output_format="dataframe") (a bit unclear, what this is, but there are only two... However, from the ids, it seems as if there were more)
id: unique id, only 123 and 226
main_entity_type: "run"
status: "active"
creation_date: time in the format of 2019-02-21 19:55:30
Get Overview about data available via api. This will be documented here:
openml.datasets.list_datasets(output_format="dataframe")
openml.study.list_studies(output_format="dataframe")
(a bit unclear, what this is, but there are only two... However, from the ids, it seems as if there were more)openml.tasks.list_tasks(output_format="dataframe")
Dependencies: Task on Dataset; Run on Task, Setup and Flow; Setup on Flow, Evaluation on Run, Task, Setup, Flow, Dataset
The text was updated successfully, but these errors were encountered: