Skip to content
Change the repository type filter

All

    Repositories list

    • Python
      0152Updated Jun 17, 2024Jun 17, 2024
    • abyss

      Public
      Jupyter Notebook
      1021Updated Nov 18, 2022Nov 18, 2022
    • Globus Labs Xtract: Extract metadata from distributed data sets.
      Python
      16230Updated Aug 4, 2022Aug 4, 2022
    • Python
      0010Updated May 24, 2022May 24, 2022
    • Jupyter notebooks
      Jupyter Notebook
      0010Updated May 10, 2022May 10, 2022
    • Xtract-Sampler Version 2.0 The Sequel!
      Python
      0000Updated Apr 29, 2022Apr 29, 2022
    • ML code to sample a file based on cheap, easily-attainable features of a file.
      Python
      0230Updated Apr 9, 2022Apr 9, 2022
    • Python
      0000Updated Apr 8, 2022Apr 8, 2022
    • Python
      0000Updated Apr 5, 2022Apr 5, 2022
    • Utilities for Xtract development
      Python
      0000Updated Mar 22, 2022Mar 22, 2022
    • SDK for Xtract
      Python
      MIT License
      0370Updated Mar 18, 2022Mar 18, 2022
    • Python
      0000Updated Mar 15, 2022Mar 15, 2022
    • Extractor to wrap the libmagic library.
      Python
      0000Updated Mar 1, 2022Mar 1, 2022
    • Python
      0000Updated Feb 27, 2022Feb 27, 2022
    • REU BigDataX Summer 2021
      Jupyter Notebook
      0000Updated Feb 11, 2022Feb 11, 2022
    • Python
      0010Updated Jan 11, 2022Jan 11, 2022
    • Python
      0250Updated Jan 11, 2022Jan 11, 2022
    • Python
      1000Updated Jan 11, 2022Jan 11, 2022
    • Jupyter Notebook
      0100Updated Jan 11, 2022Jan 11, 2022
    • Python
      0110Updated Jan 11, 2022Jan 11, 2022
    • Python
      0000Updated Jan 11, 2022Jan 11, 2022
    • Python
      0000Updated Jan 9, 2022Jan 9, 2022
    • Python
      0110Updated Dec 31, 2021Dec 31, 2021
    • Python
      0020Updated Dec 22, 2021Dec 22, 2021
    • Tika-Python is a Python binding to the Apache Tika™ REST services allowing Tika to be called natively in the Python community.
      Python
      Apache License 2.0
      236000Updated Dec 10, 2021Dec 10, 2021
    • The Globus, POSIX and S3 crawlers for instantiating metadata extraction jobs over files.
      Python
      00100Updated Dec 6, 2021Dec 6, 2021
    • Fixed version of mdf-toolbox to be used in Xtracting
      Python
      Apache License 2.0
      0000Updated Nov 19, 2021Nov 19, 2021
    • Xtract-Sampler but with DL
      Jupyter Notebook
      0000Updated Jul 15, 2021Jul 15, 2021
    • 0000Updated Jul 6, 2021Jul 6, 2021
    • The entity that executes the extraction orchestration logic for all file groups.
      0000Updated Mar 4, 2021Mar 4, 2021