deploy: 613fd34

hezarai · Jan 2, 2024 · 5f2d8f7 · 5f2d8f7
commit 5f2d8f7
Show file tree

Hide file tree

Showing 572 changed files with 170,894 additions and 0 deletions.
diff --git a/.buildinfo b/.buildinfo
@@ -0,0 +1,4 @@
+# Sphinx build info version 1
+# This file hashes the configuration used when building these files. When it is not found, a full rebuild will be done.
+config: 46dac3e782f238b726e25024d3fa6112
+tags: 645f666f9bcd5a90fca523b33c5a78b7
diff --git a/.doctrees/contributing.doctree b/.doctrees/contributing.doctree
diff --git a/.doctrees/environment.pickle b/.doctrees/environment.pickle
diff --git a/.doctrees/get_started/index.doctree b/.doctrees/get_started/index.doctree
diff --git a/.doctrees/get_started/installation.doctree b/.doctrees/get_started/installation.doctree
diff --git a/.doctrees/get_started/overview.doctree b/.doctrees/get_started/overview.doctree
diff --git a/.doctrees/get_started/quick_tour.doctree b/.doctrees/get_started/quick_tour.doctree
diff --git a/.doctrees/guide/advanced_training.doctree b/.doctrees/guide/advanced_training.doctree
diff --git a/.doctrees/guide/hezar_architecture.doctree b/.doctrees/guide/hezar_architecture.doctree
diff --git a/.doctrees/guide/index.doctree b/.doctrees/guide/index.doctree
diff --git a/.doctrees/guide/models_advanced.doctree b/.doctrees/guide/models_advanced.doctree
diff --git a/.doctrees/guide/trainer_in_depth.doctree b/.doctrees/guide/trainer_in_depth.doctree
diff --git a/.doctrees/index.doctree b/.doctrees/index.doctree
diff --git a/.doctrees/source/hezar.builders.doctree b/.doctrees/source/hezar.builders.doctree
diff --git a/.doctrees/source/hezar.configs.doctree b/.doctrees/source/hezar.configs.doctree
diff --git a/.doctrees/source/hezar.constants.doctree b/.doctrees/source/hezar.constants.doctree
diff --git a/.doctrees/source/hezar.data.data_collators.doctree b/.doctrees/source/hezar.data.data_collators.doctree
diff --git a/.doctrees/source/hezar.data.datasets.dataset.doctree b/.doctrees/source/hezar.data.datasets.dataset.doctree
diff --git a/.doctrees/source/hezar.data.datasets.doctree b/.doctrees/source/hezar.data.datasets.doctree
diff --git a/.doctrees/source/hezar.data.datasets.image_captioning_dataset.doctree b/.doctrees/source/hezar.data.datasets.image_captioning_dataset.doctree
diff --git a/.doctrees/source/hezar.data.datasets.ocr_dataset.doctree b/.doctrees/source/hezar.data.datasets.ocr_dataset.doctree
diff --git a/.doctrees/source/hezar.data.datasets.sequence_labeling_dataset.doctree b/.doctrees/source/hezar.data.datasets.sequence_labeling_dataset.doctree
diff --git a/.doctrees/source/hezar.data.datasets.text_classification_dataset.doctree b/.doctrees/source/hezar.data.datasets.text_classification_dataset.doctree
diff --git a/.doctrees/source/hezar.data.datasets.text_summarization_dataset.doctree b/.doctrees/source/hezar.data.datasets.text_summarization_dataset.doctree
diff --git a/.doctrees/source/hezar.data.doctree b/.doctrees/source/hezar.data.doctree
diff --git a/.doctrees/source/hezar.doctree b/.doctrees/source/hezar.doctree
diff --git a/.doctrees/source/hezar.embeddings.doctree b/.doctrees/source/hezar.embeddings.doctree
diff --git a/.doctrees/source/hezar.embeddings.embedding.doctree b/.doctrees/source/hezar.embeddings.embedding.doctree
diff --git a/.doctrees/source/hezar.embeddings.fasttext.doctree b/.doctrees/source/hezar.embeddings.fasttext.doctree
diff --git a/.doctrees/source/hezar.embeddings.word2vec.doctree b/.doctrees/source/hezar.embeddings.word2vec.doctree
diff --git a/.doctrees/source/hezar.metrics.accuracy.doctree b/.doctrees/source/hezar.metrics.accuracy.doctree
diff --git a/.doctrees/source/hezar.metrics.bleu.doctree b/.doctrees/source/hezar.metrics.bleu.doctree
diff --git a/.doctrees/source/hezar.metrics.cer.doctree b/.doctrees/source/hezar.metrics.cer.doctree
diff --git a/.doctrees/source/hezar.metrics.doctree b/.doctrees/source/hezar.metrics.doctree
diff --git a/.doctrees/source/hezar.metrics.f1.doctree b/.doctrees/source/hezar.metrics.f1.doctree
diff --git a/.doctrees/source/hezar.metrics.metric.doctree b/.doctrees/source/hezar.metrics.metric.doctree
diff --git a/.doctrees/source/hezar.metrics.precision.doctree b/.doctrees/source/hezar.metrics.precision.doctree
diff --git a/.doctrees/source/hezar.metrics.recall.doctree b/.doctrees/source/hezar.metrics.recall.doctree
diff --git a/.doctrees/source/hezar.metrics.rouge.doctree b/.doctrees/source/hezar.metrics.rouge.doctree
diff --git a/.doctrees/source/hezar.metrics.seqeval.doctree b/.doctrees/source/hezar.metrics.seqeval.doctree
diff --git a/.doctrees/source/hezar.metrics.wer.doctree b/.doctrees/source/hezar.metrics.wer.doctree
diff --git a/.doctrees/source/hezar.models.backbone.bert.bert.doctree b/.doctrees/source/hezar.models.backbone.bert.bert.doctree
diff --git a/.doctrees/source/hezar.models.backbone.bert.bert_config.doctree b/.doctrees/source/hezar.models.backbone.bert.bert_config.doctree
diff --git a/.doctrees/source/hezar.models.backbone.bert.doctree b/.doctrees/source/hezar.models.backbone.bert.doctree
diff --git a/.doctrees/source/hezar.models.backbone.distilbert.distilbert.doctree b/.doctrees/source/hezar.models.backbone.distilbert.distilbert.doctree
diff --git a/.doctrees/source/hezar.models.backbone.distilbert.distilbert_config.doctree b/.doctrees/source/hezar.models.backbone.distilbert.distilbert_config.doctree
diff --git a/.doctrees/source/hezar.models.backbone.distilbert.doctree b/.doctrees/source/hezar.models.backbone.distilbert.doctree
diff --git a/.doctrees/source/hezar.models.backbone.doctree b/.doctrees/source/hezar.models.backbone.doctree
diff --git a/.doctrees/source/hezar.models.backbone.roberta.doctree b/.doctrees/source/hezar.models.backbone.roberta.doctree
diff --git a/.doctrees/source/hezar.models.backbone.roberta.roberta.doctree b/.doctrees/source/hezar.models.backbone.roberta.roberta.doctree
diff --git a/.doctrees/source/hezar.models.backbone.roberta.roberta_config.doctree b/.doctrees/source/hezar.models.backbone.roberta.roberta_config.doctree
diff --git a/.doctrees/source/hezar.models.backbone.vit.doctree b/.doctrees/source/hezar.models.backbone.vit.doctree
diff --git a/.doctrees/source/hezar.models.backbone.vit.vit.doctree b/.doctrees/source/hezar.models.backbone.vit.vit.doctree
diff --git a/.doctrees/source/hezar.models.backbone.vit.vit_config.doctree b/.doctrees/source/hezar.models.backbone.vit.vit_config.doctree
diff --git a/.doctrees/source/hezar.models.doctree b/.doctrees/source/hezar.models.doctree
diff --git a/.doctrees/source/hezar.models.image2text.beit_roberta.beit_roberta_image2text.doctree b/.doctrees/source/hezar.models.image2text.beit_roberta.beit_roberta_image2text.doctree
diff --git a/.doctrees/source/hezar.models.image2text.beit_roberta.beit_roberta_image2text_config.doctree b/.doctrees/source/hezar.models.image2text.beit_roberta.beit_roberta_image2text_config.doctree
diff --git a/.doctrees/source/hezar.models.image2text.beit_roberta.doctree b/.doctrees/source/hezar.models.image2text.beit_roberta.doctree
diff --git a/.doctrees/source/hezar.models.image2text.crnn.crnn_decode_utils.doctree b/.doctrees/source/hezar.models.image2text.crnn.crnn_decode_utils.doctree
diff --git a/.doctrees/source/hezar.models.image2text.crnn.crnn_image2text.doctree b/.doctrees/source/hezar.models.image2text.crnn.crnn_image2text.doctree
diff --git a/.doctrees/source/hezar.models.image2text.crnn.crnn_image2text_config.doctree b/.doctrees/source/hezar.models.image2text.crnn.crnn_image2text_config.doctree
diff --git a/.doctrees/source/hezar.models.image2text.crnn.doctree b/.doctrees/source/hezar.models.image2text.crnn.doctree
diff --git a/.doctrees/source/hezar.models.image2text.doctree b/.doctrees/source/hezar.models.image2text.doctree
diff --git a/.doctrees/source/hezar.models.image2text.trocr.doctree b/.doctrees/source/hezar.models.image2text.trocr.doctree
diff --git a/.doctrees/source/hezar.models.image2text.trocr.trocr_image2text.doctree b/.doctrees/source/hezar.models.image2text.trocr.trocr_image2text.doctree
diff --git a/.doctrees/source/hezar.models.image2text.trocr.trocr_image2text_config.doctree b/.doctrees/source/hezar.models.image2text.trocr.trocr_image2text_config.doctree
diff --git a/.doctrees/source/hezar.models.image2text.vit_gpt2.doctree b/.doctrees/source/hezar.models.image2text.vit_gpt2.doctree
diff --git a/.doctrees/source/hezar.models.image2text.vit_gpt2.vit_gpt2_image2text.doctree b/.doctrees/source/hezar.models.image2text.vit_gpt2.vit_gpt2_image2text.doctree
diff --git a/.doctrees/source/hezar.models.image2text.vit_gpt2.vit_gpt2_image2text_config.doctree b/.doctrees/source/hezar.models.image2text.vit_gpt2.vit_gpt2_image2text_config.doctree
diff --git a/.doctrees/source/hezar.models.image2text.vit_roberta.doctree b/.doctrees/source/hezar.models.image2text.vit_roberta.doctree
diff --git a/.doctrees/source/hezar.models.image2text.vit_roberta.vit_roberta_image2text.doctree b/.doctrees/source/hezar.models.image2text.vit_roberta.vit_roberta_image2text.doctree
diff --git a/.doctrees/source/hezar.models.image2text.vit_roberta.vit_roberta_image2text_config.doctree b/.doctrees/source/hezar.models.image2text.vit_roberta.vit_roberta_image2text_config.doctree
diff --git a/.doctrees/source/hezar.models.mask_filling.bert.bert_mask_filling.doctree b/.doctrees/source/hezar.models.mask_filling.bert.bert_mask_filling.doctree
diff --git a/.doctrees/source/hezar.models.mask_filling.bert.bert_mask_filling_config.doctree b/.doctrees/source/hezar.models.mask_filling.bert.bert_mask_filling_config.doctree
diff --git a/.doctrees/source/hezar.models.mask_filling.bert.doctree b/.doctrees/source/hezar.models.mask_filling.bert.doctree
diff --git a/.doctrees/source/hezar.models.mask_filling.distilbert.distilbert_mask_filling.doctree b/.doctrees/source/hezar.models.mask_filling.distilbert.distilbert_mask_filling.doctree
diff --git a/.doctrees/source/hezar.models.mask_filling.distilbert.distilbert_mask_filling_config.doctree b/.doctrees/source/hezar.models.mask_filling.distilbert.distilbert_mask_filling_config.doctree
diff --git a/.doctrees/source/hezar.models.mask_filling.distilbert.doctree b/.doctrees/source/hezar.models.mask_filling.distilbert.doctree
diff --git a/.doctrees/source/hezar.models.mask_filling.doctree b/.doctrees/source/hezar.models.mask_filling.doctree
diff --git a/.doctrees/source/hezar.models.mask_filling.roberta.doctree b/.doctrees/source/hezar.models.mask_filling.roberta.doctree
diff --git a/.doctrees/source/hezar.models.mask_filling.roberta.roberta_mask_filling.doctree b/.doctrees/source/hezar.models.mask_filling.roberta.roberta_mask_filling.doctree
diff --git a/.doctrees/source/hezar.models.mask_filling.roberta.roberta_mask_filling_config.doctree b/.doctrees/source/hezar.models.mask_filling.roberta.roberta_mask_filling_config.doctree
diff --git a/.doctrees/source/hezar.models.model.doctree b/.doctrees/source/hezar.models.model.doctree
diff --git a/.doctrees/source/hezar.models.model_outputs.doctree b/.doctrees/source/hezar.models.model_outputs.doctree
diff --git a/.doctrees/source/hezar.models.sequence_labeling.bert.bert_sequence_labeling.doctree b/.doctrees/source/hezar.models.sequence_labeling.bert.bert_sequence_labeling.doctree
diff --git a/.doctrees/source/hezar.models.sequence_labeling.bert.bert_sequence_labeling_config.doctree b/.doctrees/source/hezar.models.sequence_labeling.bert.bert_sequence_labeling_config.doctree
diff --git a/.doctrees/source/hezar.models.sequence_labeling.bert.doctree b/.doctrees/source/hezar.models.sequence_labeling.bert.doctree
diff --git a/...ees/source/hezar.models.sequence_labeling.distilbert.distilbert_sequence_labeling.doctree b/...ees/source/hezar.models.sequence_labeling.distilbert.distilbert_sequence_labeling.doctree
diff --git a/...rce/hezar.models.sequence_labeling.distilbert.distilbert_sequence_labeling_config.doctree b/...rce/hezar.models.sequence_labeling.distilbert.distilbert_sequence_labeling_config.doctree
diff --git a/.doctrees/source/hezar.models.sequence_labeling.distilbert.doctree b/.doctrees/source/hezar.models.sequence_labeling.distilbert.doctree
diff --git a/.doctrees/source/hezar.models.sequence_labeling.doctree b/.doctrees/source/hezar.models.sequence_labeling.doctree
diff --git a/.doctrees/source/hezar.models.sequence_labeling.roberta.doctree b/.doctrees/source/hezar.models.sequence_labeling.roberta.doctree
diff --git a/.doctrees/source/hezar.models.sequence_labeling.roberta.roberta_sequence_labeling.doctree b/.doctrees/source/hezar.models.sequence_labeling.roberta.roberta_sequence_labeling.doctree
diff --git a/...es/source/hezar.models.sequence_labeling.roberta.roberta_sequence_labeling_config.doctree b/...es/source/hezar.models.sequence_labeling.roberta.roberta_sequence_labeling_config.doctree
diff --git a/.doctrees/source/hezar.models.speech_recognition.doctree b/.doctrees/source/hezar.models.speech_recognition.doctree
diff --git a/.doctrees/source/hezar.models.speech_recognition.whisper.doctree b/.doctrees/source/hezar.models.speech_recognition.whisper.doctree
diff --git a/.doctrees/source/hezar.models.speech_recognition.whisper.whisper_feature_extractor.doctree b/.doctrees/source/hezar.models.speech_recognition.whisper.whisper_feature_extractor.doctree
diff --git a/.doctrees/source/hezar.models.speech_recognition.whisper.whisper_speech_recognition.doctree b/.doctrees/source/hezar.models.speech_recognition.whisper.whisper_speech_recognition.doctree
diff --git a/.../source/hezar.models.speech_recognition.whisper.whisper_speech_recognition_config.doctree b/.../source/hezar.models.speech_recognition.whisper.whisper_speech_recognition_config.doctree
diff --git a/.doctrees/source/hezar.models.speech_recognition.whisper.whisper_tokenizer.doctree b/.doctrees/source/hezar.models.speech_recognition.whisper.whisper_tokenizer.doctree
diff --git a/.doctrees/source/hezar.models.text_classification.bert.bert_text_classification.doctree b/.doctrees/source/hezar.models.text_classification.bert.bert_text_classification.doctree
diff --git a/...rees/source/hezar.models.text_classification.bert.bert_text_classification_config.doctree b/...rees/source/hezar.models.text_classification.bert.bert_text_classification_config.doctree
diff --git a/.doctrees/source/hezar.models.text_classification.bert.doctree b/.doctrees/source/hezar.models.text_classification.bert.doctree
diff --git a/...source/hezar.models.text_classification.distilbert.distilbert_text_classification.doctree b/...source/hezar.models.text_classification.distilbert.distilbert_text_classification.doctree
diff --git a/...hezar.models.text_classification.distilbert.distilbert_text_classification_config.doctree b/...hezar.models.text_classification.distilbert.distilbert_text_classification_config.doctree
diff --git a/.doctrees/source/hezar.models.text_classification.distilbert.doctree b/.doctrees/source/hezar.models.text_classification.distilbert.doctree
diff --git a/.doctrees/source/hezar.models.text_classification.doctree b/.doctrees/source/hezar.models.text_classification.doctree
diff --git a/.doctrees/source/hezar.models.text_classification.roberta.doctree b/.doctrees/source/hezar.models.text_classification.roberta.doctree
diff --git a/...trees/source/hezar.models.text_classification.roberta.roberta_text_classification.doctree b/...trees/source/hezar.models.text_classification.roberta.roberta_text_classification.doctree
diff --git a/...ource/hezar.models.text_classification.roberta.roberta_text_classification_config.doctree b/...ource/hezar.models.text_classification.roberta.roberta_text_classification_config.doctree
diff --git a/.doctrees/source/hezar.models.text_embedding.doctree b/.doctrees/source/hezar.models.text_embedding.doctree
diff --git a/.doctrees/source/hezar.models.text_generation.doctree b/.doctrees/source/hezar.models.text_generation.doctree
diff --git a/.doctrees/source/hezar.models.text_generation.gpt2.doctree b/.doctrees/source/hezar.models.text_generation.gpt2.doctree
diff --git a/.doctrees/source/hezar.models.text_generation.gpt2.gpt2_text_generation.doctree b/.doctrees/source/hezar.models.text_generation.gpt2.gpt2_text_generation.doctree
diff --git a/.doctrees/source/hezar.models.text_generation.gpt2.gpt2_text_generation_config.doctree b/.doctrees/source/hezar.models.text_generation.gpt2.gpt2_text_generation_config.doctree
diff --git a/.doctrees/source/hezar.models.text_generation.t5.doctree b/.doctrees/source/hezar.models.text_generation.t5.doctree
diff --git a/.doctrees/source/hezar.models.text_generation.t5.t5_text_generation.doctree b/.doctrees/source/hezar.models.text_generation.t5.t5_text_generation.doctree
diff --git a/.doctrees/source/hezar.models.text_generation.t5.t5_text_generation_config.doctree b/.doctrees/source/hezar.models.text_generation.t5.t5_text_generation_config.doctree
diff --git a/.doctrees/source/hezar.preprocessors.audio_feature_extractor.doctree b/.doctrees/source/hezar.preprocessors.audio_feature_extractor.doctree
diff --git a/.doctrees/source/hezar.preprocessors.doctree b/.doctrees/source/hezar.preprocessors.doctree
diff --git a/.doctrees/source/hezar.preprocessors.image_processor.doctree b/.doctrees/source/hezar.preprocessors.image_processor.doctree
diff --git a/.doctrees/source/hezar.preprocessors.preprocessor.doctree b/.doctrees/source/hezar.preprocessors.preprocessor.doctree
diff --git a/.doctrees/source/hezar.preprocessors.text_normalizer.doctree b/.doctrees/source/hezar.preprocessors.text_normalizer.doctree
diff --git a/.doctrees/source/hezar.preprocessors.tokenizers.bpe.doctree b/.doctrees/source/hezar.preprocessors.tokenizers.bpe.doctree
diff --git a/.doctrees/source/hezar.preprocessors.tokenizers.doctree b/.doctrees/source/hezar.preprocessors.tokenizers.doctree
diff --git a/.doctrees/source/hezar.preprocessors.tokenizers.sentencepiece_bpe.doctree b/.doctrees/source/hezar.preprocessors.tokenizers.sentencepiece_bpe.doctree
diff --git a/.doctrees/source/hezar.preprocessors.tokenizers.sentencepiece_unigram.doctree b/.doctrees/source/hezar.preprocessors.tokenizers.sentencepiece_unigram.doctree
diff --git a/.doctrees/source/hezar.preprocessors.tokenizers.tokenizer.doctree b/.doctrees/source/hezar.preprocessors.tokenizers.tokenizer.doctree
diff --git a/.doctrees/source/hezar.preprocessors.tokenizers.wordpiece.doctree b/.doctrees/source/hezar.preprocessors.tokenizers.wordpiece.doctree
diff --git a/.doctrees/source/hezar.registry.doctree b/.doctrees/source/hezar.registry.doctree
diff --git a/.doctrees/source/hezar.trainer.doctree b/.doctrees/source/hezar.trainer.doctree
diff --git a/.doctrees/source/hezar.trainer.metrics_handlers.doctree b/.doctrees/source/hezar.trainer.metrics_handlers.doctree
diff --git a/.doctrees/source/hezar.trainer.trainer.doctree b/.doctrees/source/hezar.trainer.trainer.doctree
diff --git a/.doctrees/source/hezar.trainer.trainer_utils.doctree b/.doctrees/source/hezar.trainer.trainer_utils.doctree
diff --git a/.doctrees/source/hezar.utils.audio_utils.doctree b/.doctrees/source/hezar.utils.audio_utils.doctree
diff --git a/.doctrees/source/hezar.utils.common_utils.doctree b/.doctrees/source/hezar.utils.common_utils.doctree
diff --git a/.doctrees/source/hezar.utils.data_utils.doctree b/.doctrees/source/hezar.utils.data_utils.doctree
diff --git a/.doctrees/source/hezar.utils.doctree b/.doctrees/source/hezar.utils.doctree
diff --git a/.doctrees/source/hezar.utils.file_utils.doctree b/.doctrees/source/hezar.utils.file_utils.doctree
diff --git a/.doctrees/source/hezar.utils.hub_utils.doctree b/.doctrees/source/hezar.utils.hub_utils.doctree
diff --git a/.doctrees/source/hezar.utils.image_utils.doctree b/.doctrees/source/hezar.utils.image_utils.doctree
diff --git a/.doctrees/source/hezar.utils.integration_utils.doctree b/.doctrees/source/hezar.utils.integration_utils.doctree
diff --git a/.doctrees/source/hezar.utils.logging.doctree b/.doctrees/source/hezar.utils.logging.doctree
diff --git a/.doctrees/source/hezar.utils.registry_utils.doctree b/.doctrees/source/hezar.utils.registry_utils.doctree
diff --git a/.doctrees/source/index.doctree b/.doctrees/source/index.doctree
diff --git a/.doctrees/source/modules.doctree b/.doctrees/source/modules.doctree
diff --git a/.doctrees/tutorial/datasets.doctree b/.doctrees/tutorial/datasets.doctree
diff --git a/.doctrees/tutorial/embeddings.doctree b/.doctrees/tutorial/embeddings.doctree
diff --git a/.doctrees/tutorial/index.doctree b/.doctrees/tutorial/index.doctree
diff --git a/.doctrees/tutorial/models.doctree b/.doctrees/tutorial/models.doctree
diff --git a/.doctrees/tutorial/preprocessors.doctree b/.doctrees/tutorial/preprocessors.doctree
diff --git a/.doctrees/tutorial/training.doctree b/.doctrees/tutorial/training.doctree
diff --git a/.nojekyll b/.nojekyll
diff --git a/_modules/hezar/builders.html b/_modules/hezar/builders.html
diff --git a/_modules/hezar/configs.html b/_modules/hezar/configs.html
diff --git a/_modules/hezar/constants.html b/_modules/hezar/constants.html
diff --git a/_modules/hezar/data/data_collators.html b/_modules/hezar/data/data_collators.html
diff --git a/_modules/hezar/data/datasets/dataset.html b/_modules/hezar/data/datasets/dataset.html
diff --git a/_modules/hezar/data/datasets/image_captioning_dataset.html b/_modules/hezar/data/datasets/image_captioning_dataset.html
diff --git a/_modules/hezar/data/datasets/ocr_dataset.html b/_modules/hezar/data/datasets/ocr_dataset.html
diff --git a/_modules/hezar/data/datasets/sequence_labeling_dataset.html b/_modules/hezar/data/datasets/sequence_labeling_dataset.html
diff --git a/_modules/hezar/data/datasets/text_classification_dataset.html b/_modules/hezar/data/datasets/text_classification_dataset.html
diff --git a/_modules/hezar/data/datasets/text_summarization_dataset.html b/_modules/hezar/data/datasets/text_summarization_dataset.html
diff --git a/_modules/hezar/embeddings/embedding.html b/_modules/hezar/embeddings/embedding.html
diff --git a/_modules/hezar/embeddings/fasttext.html b/_modules/hezar/embeddings/fasttext.html
diff --git a/_modules/hezar/embeddings/word2vec.html b/_modules/hezar/embeddings/word2vec.html
diff --git a/_modules/hezar/metrics/accuracy.html b/_modules/hezar/metrics/accuracy.html
diff --git a/_modules/hezar/metrics/bleu.html b/_modules/hezar/metrics/bleu.html
diff --git a/_modules/hezar/metrics/cer.html b/_modules/hezar/metrics/cer.html
diff --git a/_modules/hezar/metrics/f1.html b/_modules/hezar/metrics/f1.html
diff --git a/_modules/hezar/metrics/metric.html b/_modules/hezar/metrics/metric.html
diff --git a/_modules/hezar/metrics/precision.html b/_modules/hezar/metrics/precision.html
diff --git a/_modules/hezar/metrics/recall.html b/_modules/hezar/metrics/recall.html
diff --git a/_modules/hezar/metrics/rouge.html b/_modules/hezar/metrics/rouge.html
diff --git a/_modules/hezar/metrics/seqeval.html b/_modules/hezar/metrics/seqeval.html
diff --git a/_modules/hezar/metrics/wer.html b/_modules/hezar/metrics/wer.html
diff --git a/_modules/hezar/models/backbone/bert/bert.html b/_modules/hezar/models/backbone/bert/bert.html
diff --git a/_modules/hezar/models/backbone/bert/bert_config.html b/_modules/hezar/models/backbone/bert/bert_config.html
diff --git a/_modules/hezar/models/backbone/distilbert/distilbert.html b/_modules/hezar/models/backbone/distilbert/distilbert.html
diff --git a/_modules/hezar/models/backbone/distilbert/distilbert_config.html b/_modules/hezar/models/backbone/distilbert/distilbert_config.html
diff --git a/_modules/hezar/models/backbone/roberta/roberta.html b/_modules/hezar/models/backbone/roberta/roberta.html
diff --git a/_modules/hezar/models/backbone/roberta/roberta_config.html b/_modules/hezar/models/backbone/roberta/roberta_config.html
diff --git a/_modules/hezar/models/backbone/vit/vit.html b/_modules/hezar/models/backbone/vit/vit.html
diff --git a/_modules/hezar/models/backbone/vit/vit_config.html b/_modules/hezar/models/backbone/vit/vit_config.html
diff --git a/_modules/hezar/models/image2text/beit_roberta/beit_roberta_image2text.html b/_modules/hezar/models/image2text/beit_roberta/beit_roberta_image2text.html
diff --git a/_modules/hezar/models/image2text/beit_roberta/beit_roberta_image2text_config.html b/_modules/hezar/models/image2text/beit_roberta/beit_roberta_image2text_config.html
diff --git a/_modules/hezar/models/image2text/crnn/crnn_decode_utils.html b/_modules/hezar/models/image2text/crnn/crnn_decode_utils.html
diff --git a/_modules/hezar/models/image2text/crnn/crnn_image2text.html b/_modules/hezar/models/image2text/crnn/crnn_image2text.html
diff --git a/_modules/hezar/models/image2text/crnn/crnn_image2text_config.html b/_modules/hezar/models/image2text/crnn/crnn_image2text_config.html
diff --git a/_modules/hezar/models/image2text/trocr/trocr_image2text.html b/_modules/hezar/models/image2text/trocr/trocr_image2text.html
diff --git a/_modules/hezar/models/image2text/trocr/trocr_image2text_config.html b/_modules/hezar/models/image2text/trocr/trocr_image2text_config.html
diff --git a/_modules/hezar/models/image2text/vit_gpt2/vit_gpt2_image2text.html b/_modules/hezar/models/image2text/vit_gpt2/vit_gpt2_image2text.html
diff --git a/_modules/hezar/models/image2text/vit_gpt2/vit_gpt2_image2text_config.html b/_modules/hezar/models/image2text/vit_gpt2/vit_gpt2_image2text_config.html
diff --git a/_modules/hezar/models/image2text/vit_roberta/vit_roberta_image2text.html b/_modules/hezar/models/image2text/vit_roberta/vit_roberta_image2text.html
diff --git a/_modules/hezar/models/image2text/vit_roberta/vit_roberta_image2text_config.html b/_modules/hezar/models/image2text/vit_roberta/vit_roberta_image2text_config.html
diff --git a/_modules/hezar/models/model.html b/_modules/hezar/models/model.html
diff --git a/_modules/hezar/models/model_outputs.html b/_modules/hezar/models/model_outputs.html
diff --git a/_modules/hezar/models/sequence_labeling/bert/bert_sequence_labeling.html b/_modules/hezar/models/sequence_labeling/bert/bert_sequence_labeling.html
diff --git a/_modules/hezar/models/sequence_labeling/bert/bert_sequence_labeling_config.html b/_modules/hezar/models/sequence_labeling/bert/bert_sequence_labeling_config.html
diff --git a/_modules/hezar/models/sequence_labeling/distilbert/distilbert_sequence_labeling.html b/_modules/hezar/models/sequence_labeling/distilbert/distilbert_sequence_labeling.html
diff --git a/_modules/hezar/models/sequence_labeling/distilbert/distilbert_sequence_labeling_config.html b/_modules/hezar/models/sequence_labeling/distilbert/distilbert_sequence_labeling_config.html
diff --git a/_modules/hezar/models/sequence_labeling/roberta/roberta_sequence_labeling.html b/_modules/hezar/models/sequence_labeling/roberta/roberta_sequence_labeling.html
diff --git a/_modules/hezar/models/sequence_labeling/roberta/roberta_sequence_labeling_config.html b/_modules/hezar/models/sequence_labeling/roberta/roberta_sequence_labeling_config.html
diff --git a/_modules/hezar/models/speech_recognition/whisper/whisper_feature_extractor.html b/_modules/hezar/models/speech_recognition/whisper/whisper_feature_extractor.html
diff --git a/_modules/hezar/models/speech_recognition/whisper/whisper_speech_recognition.html b/_modules/hezar/models/speech_recognition/whisper/whisper_speech_recognition.html
diff --git a/_modules/hezar/models/speech_recognition/whisper/whisper_speech_recognition_config.html b/_modules/hezar/models/speech_recognition/whisper/whisper_speech_recognition_config.html
diff --git a/_modules/hezar/models/speech_recognition/whisper/whisper_tokenizer.html b/_modules/hezar/models/speech_recognition/whisper/whisper_tokenizer.html
diff --git a/_modules/hezar/models/text_classification/bert/bert_text_classification.html b/_modules/hezar/models/text_classification/bert/bert_text_classification.html
diff --git a/_modules/hezar/models/text_classification/bert/bert_text_classification_config.html b/_modules/hezar/models/text_classification/bert/bert_text_classification_config.html
diff --git a/_modules/hezar/models/text_classification/distilbert/distilbert_text_classification.html b/_modules/hezar/models/text_classification/distilbert/distilbert_text_classification.html
diff --git a/...es/hezar/models/text_classification/distilbert/distilbert_text_classification_config.html b/...es/hezar/models/text_classification/distilbert/distilbert_text_classification_config.html
diff --git a/_modules/hezar/models/text_classification/roberta/roberta_text_classification.html b/_modules/hezar/models/text_classification/roberta/roberta_text_classification.html
diff --git a/_modules/hezar/models/text_classification/roberta/roberta_text_classification_config.html b/_modules/hezar/models/text_classification/roberta/roberta_text_classification_config.html
diff --git a/_modules/hezar/models/text_generation/gpt2/gpt2_text_generation.html b/_modules/hezar/models/text_generation/gpt2/gpt2_text_generation.html
diff --git a/_modules/hezar/models/text_generation/gpt2/gpt2_text_generation_config.html b/_modules/hezar/models/text_generation/gpt2/gpt2_text_generation_config.html
diff --git a/_modules/hezar/models/text_generation/t5/t5_text_generation.html b/_modules/hezar/models/text_generation/t5/t5_text_generation.html
diff --git a/_modules/hezar/models/text_generation/t5/t5_text_generation_config.html b/_modules/hezar/models/text_generation/t5/t5_text_generation_config.html
diff --git a/_modules/hezar/preprocessors/audio_feature_extractor.html b/_modules/hezar/preprocessors/audio_feature_extractor.html
diff --git a/_modules/hezar/preprocessors/image_processor.html b/_modules/hezar/preprocessors/image_processor.html
diff --git a/_modules/hezar/preprocessors/preprocessor.html b/_modules/hezar/preprocessors/preprocessor.html
diff --git a/_modules/hezar/preprocessors/text_normalizer.html b/_modules/hezar/preprocessors/text_normalizer.html
diff --git a/_modules/hezar/preprocessors/tokenizers/bpe.html b/_modules/hezar/preprocessors/tokenizers/bpe.html
diff --git a/_modules/hezar/preprocessors/tokenizers/sentencepiece_bpe.html b/_modules/hezar/preprocessors/tokenizers/sentencepiece_bpe.html
diff --git a/_modules/hezar/preprocessors/tokenizers/sentencepiece_unigram.html b/_modules/hezar/preprocessors/tokenizers/sentencepiece_unigram.html
diff --git a/_modules/hezar/preprocessors/tokenizers/tokenizer.html b/_modules/hezar/preprocessors/tokenizers/tokenizer.html
diff --git a/_modules/hezar/preprocessors/tokenizers/wordpiece.html b/_modules/hezar/preprocessors/tokenizers/wordpiece.html
diff --git a/_modules/hezar/registry.html b/_modules/hezar/registry.html
diff --git a/_modules/hezar/trainer/metrics_handlers.html b/_modules/hezar/trainer/metrics_handlers.html
diff --git a/_modules/hezar/trainer/trainer.html b/_modules/hezar/trainer/trainer.html
diff --git a/_modules/hezar/trainer/trainer_utils.html b/_modules/hezar/trainer/trainer_utils.html
diff --git a/_modules/hezar/utils/audio_utils.html b/_modules/hezar/utils/audio_utils.html
diff --git a/_modules/hezar/utils/common_utils.html b/_modules/hezar/utils/common_utils.html
diff --git a/_modules/hezar/utils/data_utils.html b/_modules/hezar/utils/data_utils.html
diff --git a/_modules/hezar/utils/file_utils.html b/_modules/hezar/utils/file_utils.html
diff --git a/_modules/hezar/utils/hub_utils.html b/_modules/hezar/utils/hub_utils.html
diff --git a/_modules/hezar/utils/image_utils.html b/_modules/hezar/utils/image_utils.html
diff --git a/_modules/hezar/utils/integration_utils.html b/_modules/hezar/utils/integration_utils.html
diff --git a/_modules/hezar/utils/logging.html b/_modules/hezar/utils/logging.html
diff --git a/_modules/hezar/utils/registry_utils.html b/_modules/hezar/utils/registry_utils.html
diff --git a/_modules/index.html b/_modules/index.html
diff --git a/_sources/contributing.md.txt b/_sources/contributing.md.txt
@@ -0,0 +1,102 @@
+# Contributing to Hezar
+Welcome to Hezar! We greatly appreciate your interest in contributing to this project and helping us make it even more
+valuable to the Persian community. Whether you're a developer, researcher, or enthusiast, your contributions are
+invaluable in helping us grow and improve Hezar.
+
+Before you start contributing, please take a moment to review the following guidelines.
+
+## Code of Conduct
+
+This project and its community adhere to
+the [Contributor Code of Conduct](https://github.com/hezarai/hezar/blob/main/CODE_OF_CONDUCT.md).
+
+## How to Contribute
+
+### Reporting Bugs
+
+If you come across a bug or unexpected behavior, please help us by reporting it.
+Use the [GitHub Issue Tracker](https://github.com/hezarai/hezar/issues) to create a detailed bug report.
+Include information such as:
+
+- A clear and descriptive title.
+- Steps to reproduce the bug.
+- Expected behavior.
+- Actual behavior.
+- Your operating system and Python version.
+
+### Adding features
+
+Have a great idea for a new feature or improvement? We'd love to hear it. You can open an issue and add your suggestion
+with a clear description and further suggestions on how it can be implemented. Also, if you already can implement it
+yourself, just follow the instructions on how you can send a PR.
+
+### Adding/Improving documents
+
+Have a suggestion to enhance our documentation or want to contribute entirely new sections? We welcome your input!<br>
+Here's how you can get involved:<br>
+Docs website is deployed here: [https://hezarai.github.io/hezar](https://hezarai.github.io/hezar) and the source for the
+docs are located at the [docs](https://github.com/hezarai/hezar/tree/main/docs) folder in the root of the repo. Feel
+free to apply your changes or add new docs to this section. Notice that docs are written in Markdown format. In case you have
+added new files to this section, you must include them in the `index.md` file in the same folder. For example, if you've
+added the file `new_doc.md` to the `get_started` folder, you have to modify `get_started/index.md` and put your file
+name there.
+
+### Commit guidelines
+
+#### Functional best practices
+
+- Ensure only one "logical change" per commit for efficient review and flaw identification.
+- Smaller code changes facilitate quicker reviews and easier troubleshooting using Git's bisect capability.
+- Avoid mixing whitespace changes with functional code changes.
+- Avoid mixing two unrelated functional changes.
+- Refrain from sending large new features in a single giant commit.
+
+#### Styling best practices
+
+- Use imperative mood in the subject (e.g., "Add support for ..." not "Adding support or added support") .
+- Keep the subject line short and concise, preferably less than 50 characters.
+- Capitalize the subject line and do not end it with a period.
+- Wrap body lines at 72 characters.
+- Use the body to explain what and why a change was made.
+- Do not explain the "how" in the commit message; reserve it for documentation or code.
+- For commits referencing an issue or pull request, write the proper commit subject followed by the reference in
+  parentheses (e.g., "Add NFKC normalizer (#9999)").
+- Reference codes & paths in back quotes (e.g., `variable`, `method()`, `Class()`, `file.py`).
+- Preferably use the following [gitmoji](https://gitmoji.dev/) compatible codes at the beginning of your commit message:
+
+| Emoji Code           | Emoji | Description                                  | Example Commit                                                 |
+|----------------------|-------|----------------------------------------------|----------------------------------------------------------------|
+| `:bug:`              | 🐛    | Fix a bug or issue                           | `:bug: Fix issue with image loading in DataLoader`             |
+| `:sparkles:`         | ✨     | Add feature or improvements                  | `:sparkles: Introduce support for text summarization`          |
+| `:recycle:`          | ♻️    | Refactor code (backward compatible refactor) | `:recycle: Refactor data preprocessing utilities`              |
+| `:memo:`             | 📝    | Add or change docs                           | `:memo: Update documentation for text classification`          |
+| `:pencil2:`          | ✏️    | Minor change or improvement                  | `:pencil2: Improve logging in Trainer`                         |
+| `:fire:`             | 🔥    | Remove code or file                          | `:fire: Remove outdated utility function`                      |
+| `:boom:`             | 💥    | Introduce breaking changes                   | `:boom: Update API, requires modification in existing scripts` |
+| `:test_tube:`        | 🧪    | Test-related changes                         | `:test_tube: Add unit tests for data loading functions`        |
+| `:bookmark:`         | 🔖    | Version release                              | `:bookmark: Release v1.0.0`                                    |
+| `:adhesive_bandage:` | 🩹    | Non-critical fix                             | `:adhesive_bandage: Fix minor issue in BPE tokenizer`          |
+
+## Sending a PR
+
+In order to apply any change to the repo, you have to follow these step:
+
+1. Fork the Hezar repository.
+2. Create a new branch for your feature, bug fix, etc.
+3. Make your changes.
+4. Update the documentation to reflect your changes.
+5. Ensure your code adheres to the [Google Python Style Guide](https://google.github.io/styleguide/pyguide.html).
+6. Format the code using `ruff` (`ruff check --fix .`)
+7. Write tests to ensure the functionality if needed.
+8. Run tests and make sure all of them pass. (Skip this step if your changes do not involve codes)
+9. Open a pull request from your fork and the PR template will be automatically loaded to help you do the rest.
+10. Be responsive to feedback and comments during the review process.
+11. Thanks for contributing to the Hezar project.😉❤️
+
+## License
+
+By contributing to Hezar, you agree that your contributions will be licensed under
+the [Apache 2.0 License](https://github.com/hezarai/hezar/blob/main/LICENSE).
+
+We look forward to your contributions and appreciate your efforts in making Hezar a powerful AI tool for the Persian
+community!
diff --git a/_sources/get_started/index.md.txt b/_sources/get_started/index.md.txt
@@ -0,0 +1,8 @@
+# Get Started
+```{toctree}
+:maxdepth: 1
+
+overview.md
+installation.md
+quick_tour.md
+```
diff --git a/_sources/get_started/installation.md.txt b/_sources/get_started/installation.md.txt
@@ -0,0 +1,41 @@
+# Installation
+
+## Install from PyPi
+Installing Hezar is as easy as any other Python library! Most of the requirements are cross-platform and installing
+them on any machine is a piece of cake!
+
+```
+pip install hezar
+```
+### Installation variations
+Hezar is packed with a lot of tools that are dependent on other packages. Most of the
+time you might not want everything to be installed, hence, providing multiple variations of
+Hezar so that the installation is light and fast for general use.
+
+You can install optional dependencies for each mode like so:
+```
+pip install hezar[nlp]  # For natural language processing
+pip install hezar[vision]  # For computer vision and image processing
+pip install hezar[audio]  # For audio and speech processing
+pip install hezar[embeddings]  # For word embeddings
+```
+Or you can also install everything using:
+```
+pip install hezar[all]
+```
+## Install from source
+Also, you can install the dev version of the library using the source:
+```
+pip install git+https://github.com/hezarai/hezar.git
+```
+
+## Test installation
+From a Python console or in CLI just import `hezar` and check the version:
+```python
+import hezar
+
+print(hezar.__version__)
+```
+```
+0.23.1
+```
diff --git a/_sources/get_started/overview.md.txt b/_sources/get_started/overview.md.txt
@@ -0,0 +1,20 @@
+# Overview
+
+Welcome to Hezar! A library that makes state-of-the-art machine learning as easy as possible aimed for the Persian
+language, built by the Persian community!
+
+In Hezar, the primary goal is to provide plug-and-play AI/ML utilities so that you don't need to know much about what's
+going on under the hood. Hezar is not just a model library, but instead it's packed with every aspect you need for any
+ML pipeline like datasets, trainers, preprocessors, feature extractors, etc.
+
+Hezar is a library that:
+- brings together all the best works in AI for Persian
+- makes using AI models as easy as a couple of lines of code
+- seamlessly integrates with Hugging Face Hub for all of its models
+- has a highly developer-friendly interface
+- has a task-based model interface which is more convenient for general users.
+- is packed with additional tools like word embeddings, tokenizers, feature extractors, etc.
+- comes with a lot of supplementary ML tools for deployment, benchmarking, optimization, etc.
+- and more!
+
+To find out more, just take the [quick tour](quick_tour.md)!
diff --git a/_sources/get_started/quick_tour.md.txt b/_sources/get_started/quick_tour.md.txt
@@ -0,0 +1,214 @@
+# Quick Tour
+## Models
+There's a bunch of ready to use trained models for different tasks on the Hub!
+
+**🤗Hugging Face Hub Page**: [https://huggingface.co/hezarai](https://huggingface.co/hezarai)
+
+Let's walk you through some examples!
+
+- **Text Classification (sentiment analysis, categorization, etc)**
+```python
+from hezar.models import Model
+
+example = ["هزار، کتابخانه‌ای کامل برای به کارگیری آسان هوش مصنوعی"]
+model = Model.load("hezarai/bert-fa-sentiment-dksf")
+outputs = model.predict(example)
+print(outputs)
+```
+```
+[[{'label': 'positive', 'score': 0.812910258769989}]]
+```
+- **Sequence Labeling (POS, NER, etc.)**
+```python
+from hezar.models import Model
+
+pos_model = Model.load("hezarai/bert-fa-pos-lscp-500k")  # Part-of-speech
+ner_model = Model.load("hezarai/bert-fa-ner-arman")  # Named entity recognition
+inputs = ["شرکت هوش مصنوعی هزار"]
+pos_outputs = pos_model.predict(inputs)
+ner_outputs = ner_model.predict(inputs)
+print(f"POS: {pos_outputs}")
+print(f"NER: {ner_outputs}")
+```
+```
+POS: [[{'token': 'شرکت', 'label': 'Ne'}, {'token': 'هوش', 'label': 'Ne'}, {'token': 'مصنوعی', 'label': 'AJe'}, {'token': 'هزار', 'label': 'NUM'}]]
+NER: [[{'token': 'شرکت', 'label': 'B-org'}, {'token': 'هوش', 'label': 'I-org'}, {'token': 'مصنوعی', 'label': 'I-org'}, {'token': 'هزار', 'label': 'I-org'}]]
+```
+- **Language Modeling (Mask Filling)**
+```python
+from hezar.models import Model
+
+roberta_mask_filling = Model.load("hezarai/roberta-fa-mask-filling")
+inputs = ["سلام بچه ها حالتون <mask>"]
+outputs = roberta_mask_filling.predict(inputs, top_k=1)
+print(outputs)
+```
+```
+[[{'token': 'چطوره', 'sequence': 'سلام بچه ها حالتون چطوره', 'token_id': 34505, 'score': 0.2230483442544937}]]
+```
+- **Speech Recognition**
+```python
+from hezar.models import Model
+
+whisper = Model.load("hezarai/whisper-small-fa")
+transcripts = whisper.predict("examples/assets/speech_example.mp3")
+print(transcripts)
+```
+```
+[{'text': 'و این تنها محدود به محیط کار نیست'}]
+```
+- **Image to Text (OCR)**
+```python
+from hezar.models import Model
+# OCR with TrOCR
+model = Model.load("hezarai/trocr-base-fa-v2")
+texts = model.predict(["examples/assets/ocr_example.jpg"])
+print(f"TrOCR Output: {texts}")
+
+# OCR with CRNN
+model = Model.load("hezarai/crnn-fa-printed-96-long")
+texts = model.predict("examples/assets/ocr_example.jpg")
+print(f"CRNN Output: {texts}")
+```
+```
+TrOCR Output: [{'text': 'چه میشه کرد، باید صبر کنیم'}]
+CRNN Output: [{'text': 'چه میشه کرد، باید صبر کنیم'}]
+```
+![](https://raw.githubusercontent.com/hezarai/hezar/main/examples/assets/ocr_example.jpg)
+
+- **Image to Text (License Plate Recognition)**
+```python
+from hezar.models import Model
+
+model = Model.load("hezarai/crnn-fa-64x256-license-plate-recognition")
+plate_text = model.predict("assets/license_plate_ocr_example.jpg")
+print(plate_text)  # Persian text of mixed numbers and characters might not show correctly in the console
+```
+```
+[{'text': '۵۷س۷۷۹۷۷'}]
+```
+![](https://raw.githubusercontent.com/hezarai/hezar/main/examples/assets/license_plate_ocr_example.jpg)
+
+- **Image to Text (Image Captioning)**
+```python
+from hezar.models import Model
+
+model = Model.load("hezarai/vit-roberta-fa-image-captioning-flickr30k")
+texts = model.predict("examples/assets/image_captioning_example.jpg")
+print(texts)
+```
+```
+[{'text': 'سگی با توپ تنیس در دهانش می دود.'}]
+```
+![](https://raw.githubusercontent.com/hezarai/hezar/main/examples/assets/image_captioning_example.jpg)
+
+We constantly keep working on adding and training new models and this section will hopefully be expanding over time ;)
+## Word Embeddings
+- **FastText**
+```python
+from hezar.embeddings import Embedding
+
+fasttext = Embedding.load("hezarai/fasttext-fa-300")
+most_similar = fasttext.most_similar("هزار")
+print(most_similar)
+```
+```
+[{'score': 0.7579, 'word': 'میلیون'},
+ {'score': 0.6943, 'word': '21هزار'},
+ {'score': 0.6861, 'word': 'میلیارد'},
+ {'score': 0.6825, 'word': '26هزار'},
+ {'score': 0.6803, 'word': '٣هزار'}]
+```
+- **Word2Vec (Skip-gram)**
+```python
+from hezar.embeddings import Embedding
+
+word2vec = Embedding.load("hezarai/word2vec-skipgram-fa-wikipedia")
+most_similar = word2vec.most_similar("هزار")
+print(most_similar)
+```
+```
+[{'score': 0.7885, 'word': 'چهارهزار'},
+ {'score': 0.7788, 'word': '۱۰هزار'},
+ {'score': 0.7727, 'word': 'دویست'},
+ {'score': 0.7679, 'word': 'میلیون'},
+ {'score': 0.7602, 'word': 'پانصد'}]
+```
+- **Word2Vec (CBOW)**
+```python
+from hezar.embeddings import Embedding
+
+word2vec = Embedding.load("hezarai/word2vec-cbow-fa-wikipedia")
+most_similar = word2vec.most_similar("هزار")
+print(most_similar)
+```
+```
+[{'score': 0.7407, 'word': 'دویست'},
+ {'score': 0.7400, 'word': 'میلیون'},
+ {'score': 0.7326, 'word': 'صد'},
+ {'score': 0.7276, 'word': 'پانصد'},
+ {'score': 0.7011, 'word': 'سیصد'}]
+```
+For a full guide on the embeddings module, see the [embeddings tutorial](https://hezarai.github.io/hezar/tutorial/embeddings.html).
+## Datasets
+You can load any of the datasets on the [Hub](https://huggingface.co/hezarai) like below:
+```python
+from hezar.data import Dataset
+
+sentiment_dataset = Dataset.load("hezarai/sentiment-dksf")  # A TextClassificationDataset instance
+lscp_dataset = Dataset.load("hezarai/lscp-pos-500k")  # A SequenceLabelingDataset instance
+xlsum_dataset = Dataset.load("hezarai/xlsum-fa")  # A TextSummarizationDataset instance
+alpr_ocr_dataset = Dataset.load("hezarai/persian-license-plate-v1")  # An OCRDataset instance
+...
+```
+The returned dataset objects from `load()` are PyTorch Dataset wrappers for specific tasks and can be used by a data loader out-of-the-box!
+
+You can also load Hezar's datasets using 🤗Datasets:
+```python
+from datasets import load_dataset
+
+dataset = load_dataset("hezarai/sentiment-dksf")
+```
+For a full guide on Hezar's datasets, see the [datasets tutorial](https://hezarai.github.io/hezar/tutorial/datasets.html).
+## Training
+Hezar makes it super easy to train models using out-of-the-box models and datasets provided in the library.
+
+```python
+from hezar.models import BertSequenceLabeling, BertSequenceLabelingConfig
+from hezar.data import Dataset
+from hezar.trainer import Trainer, TrainerConfig
+from hezar.preprocessors import Preprocessor
+
+base_model_path = "hezarai/bert-base-fa"
+dataset_path = "hezarai/lscp-pos-500k"
+
+train_dataset = Dataset.load(dataset_path, split="train", tokenizer_path=base_model_path)
+eval_dataset = Dataset.load(dataset_path, split="test", tokenizer_path=base_model_path)
+
+model = BertSequenceLabeling(BertSequenceLabelingConfig(id2label=train_dataset.config.id2label))
+preprocessor = Preprocessor.load(base_model_path)
+
+train_config = TrainerConfig(
+    output_dir="bert-fa-pos-lscp-500k",
+    task="sequence_labeling",
+    device="cuda",
+    init_weights_from=base_model_path,
+    batch_size=8,
+    num_epochs=5,
+    metrics=["seqeval"],
+)
+
+trainer = Trainer(
+    config=train_config,
+    model=model,
+    train_dataset=train_dataset,
+    eval_dataset=eval_dataset,
+    data_collator=train_dataset.data_collator,
+    preprocessor=preprocessor,
+)
+trainer.train()
+
+trainer.push_to_hub("bert-fa-pos-lscp-500k")  # push model, config, preprocessor, trainer files and configs
+```
+
+Want to go deeper? Check out the [guides](../guide/index.md).
diff --git a/_sources/guide/advanced_training.md.txt b/_sources/guide/advanced_training.md.txt
@@ -0,0 +1,2 @@
+# Advanced Training
+Docs coming soon, stay tuned!