Skip to content

Commit

Permalink
Merge pull request #6 from Machine-Learning-Pipelines/llm
Browse files Browse the repository at this point in the history
Llm
  • Loading branch information
adbX authored Aug 2, 2023
2 parents fca0121 + 610f3bc commit d6057ca
Show file tree
Hide file tree
Showing 47 changed files with 6,155 additions and 3,199 deletions.
4 changes: 3 additions & 1 deletion .gitignore
Original file line number Diff line number Diff line change
Expand Up @@ -6,9 +6,11 @@ site/
sphinxdocs/

case-studies/individual/

case-studies/arxiv-corpus/gold_standard_old/
case-studies/arxiv-corpus/gold_standard_test/
client_secrets.json

outputs/
doc/

# Byte-compiled / optimized / DLL files
Expand Down
7 changes: 6 additions & 1 deletion README.md
Original file line number Diff line number Diff line change
Expand Up @@ -22,5 +22,10 @@ Distributed under the MIT License. See `LICENSE.txt` for more information.
### Funding

We thank the The Center for Research and Education in AI and Learning (REAL@USC) for their funding and support towards this project.

Any opinions, findings, and conclusions or recommendations expressed in this material are those of the author(s) and do not necessarily reflect the views of the The Center for Research and Education in AI and Learning (REAL@USC).

---

This material is based upon work supported by the National Science Foundation under Grant No. OAC 2138773

---
6 changes: 3 additions & 3 deletions case-studies/arxiv-corpus/gold_standard.dvc
Original file line number Diff line number Diff line change
@@ -1,5 +1,5 @@
outs:
- md5: 95f383f75d5092838472870faaf4528d.dir
size: 1356118693
nfiles: 4276
- md5: fa4149515a6f4ab47eb3b373cfc8f815.dir
size: 1353977566
nfiles: 4273
path: gold_standard
107 changes: 54 additions & 53 deletions case-studies/arxiv-corpus/manual_eval.csv

Large diffs are not rendered by default.

Binary file removed case-studies/plots/heatmap.png
Binary file not shown.
Binary file removed case-studies/plots/heatmap_manual_eval.png
Binary file not shown.
Binary file removed case-studies/plots/heatmap_repo.png
Binary file not shown.
Binary file removed case-studies/plots/heatmap_repo_palettes.png
Binary file not shown.
Binary file removed case-studies/plots/workflow.png
Binary file not shown.
Empty file added docs/architecture.md
Empty file.
35 changes: 30 additions & 5 deletions docs/docstrings.md
Original file line number Diff line number Diff line change
@@ -1,10 +1,35 @@
# Docstrings
# Architecture

## Documentation of `tex_eval.py`
## `tex_eval` module

::: src.reproscreener.tex_eval
The `tex_eval` module is used to evaluate `.tex` files exttracted from the arXiv source tarball of the paper.

## Documentation of `repo_eval.py`
::: reproscreener.tex_eval
options:
show_source: false
heading_level: 3

::: src.reproscreener.repo_eval
## `repo_eval` module

::: reproscreener.repo_eval
options:
show_source: false
heading_level: 3

## `scrape_arxiv` module

The `scrape_arxiv` module is used to obtain the gold standard dataset from the arXiv. It includes the PDFs, source tarballs, and abstract for each paper.

::: reproscreener.scrape_arxiv
options:
show_source: false
heading_level: 3

## `gold_standard` module

The `gold_standard` module is used to evaluate and compare the performance of `reproscreener` on the gold standard dataset. It uses the data from the `scrape_arxiv` module.

::: reproscreener.gold_standard
options:
show_source: false
heading_level: 3
914 changes: 798 additions & 116 deletions docs/evaluation_results.ipynb

Large diffs are not rendered by default.

8 changes: 7 additions & 1 deletion docs/funding.md
Original file line number Diff line number Diff line change
@@ -1,5 +1,11 @@
# Funding



We thank the The Center for Research and Education in AI and Learning (REAL@USC) for their funding and support towards this project.

Any opinions, findings, and conclusions or recommendations expressed in this material are those of the author(s) and do not necessarily reflect the views of the The Center for Research and Education in AI and Learning (REAL@USC).
Any opinions, findings, and conclusions or recommendations expressed in this material are those of the author(s) and do not necessarily reflect the views of the The Center for Research and Education in AI and Learning (REAL@USC).

---

This material is based upon work supported by the National Science Foundation under Grant No. OAC 2138773
53 changes: 0 additions & 53 deletions docs/manual_eval.csv

This file was deleted.

Loading

0 comments on commit d6057ca

Please sign in to comment.