Skip to content

Commit

Permalink
Update wheel2.json
Browse files Browse the repository at this point in the history
  • Loading branch information
Franri3008 committed Dec 18, 2024
1 parent 0c2544b commit bf1a375
Showing 1 changed file with 13 additions and 2 deletions.
15 changes: 13 additions & 2 deletions pages/Wheels/wheel2.json
Original file line number Diff line number Diff line change
@@ -1,7 +1,7 @@
{
"central": {
"name": "OPEN SOURCE",
"subname": "AI"
"name": "Hugging Face",
"subname": "Datasets"
},
"items": [
{
Expand All @@ -25,6 +25,17 @@
"color": "#5086BC",
"x": 0.0,
"y": -0.05
},
{
"name": "arXiver ",
"subname": "Neuralwork",
"bullets": ["Curated for question-answering tasks", "Data is converted to highly readable (.mmd) format"],
"description": "The largest open and permissible licensed text dataset, comprising over 2 trillion tokens (2,003,039,184,047 tokens). Contains a diverse set of sources such as books, newspapers, scientific articles, government and legal documents, code, and more.\nEstimated number of rows: 396,953,971\nSize of auto-converted Parquet files (First 5GB): 2.96 GB\nKey feature: Data is permissively licensed, meaning it can be used, modified, and redistributed without legal ambiguity or risk of infringement.",
"icon": "https://drive.google.com/file/d/10OJ348DMNH3nd8yJBqyP44unwQPL-x9a/view?usp=drive_link",
"url": "https://huggingface.co/datasets/PleIAs/common_corpus",
"color": "#2FAD3B",
"x": 0.0,
"y": -0.05
}
]
}
Expand Down

0 comments on commit bf1a375

Please sign in to comment.