Skip to content
Change the repository type filter

All

    Repositories list

    • sahara

      Public
      Sahara aims to provide users with simple means to provision a data intensive cluster (Hadoop, Spark) by specifying several parameters like software versions, cluster topology, nodes hardware details and a few more.
      Python
      Apache License 2.0
      119200Updated Nov 3, 2015Nov 3, 2015
    • Disk image elements for Savanna
      Shell
      Apache License 2.0
      31000Updated Sep 24, 2015Sep 24, 2015
    • Implementation of a new ROLLUP operator for Apache Pig, that results in optimal execution plans
      Java
      Apache License 2.0
      1000Updated May 7, 2015May 7, 2015
    • pyostack

      Public
      Simple OpenStack Python bindings
      Python
      Apache License 2.0
      0000Updated Apr 9, 2015Apr 9, 2015
    • Python logging handler for Logstash.
      Python
      MIT License
      136000Updated Oct 29, 2014Oct 29, 2014
    • schedsim

      Public
      Python
      Other
      1100Updated Oct 22, 2014Oct 22, 2014
    • treelib

      Public
      Decision trees library and more
      Scala
      Apache License 2.0
      1400Updated Oct 9, 2014Oct 9, 2014
    • pig

      Public
      Mirror of Apache Pig
      Java
      Apache License 2.0
      450000Updated Oct 3, 2014Oct 3, 2014
    • This is the PIG ROLLUP repo
      Java
      Apache License 2.0
      1000Updated Oct 3, 2014Oct 3, 2014
    • MRTRIAGE

      Public
      Python
      2000Updated Sep 22, 2014Sep 22, 2014
    • A possible implementation of a decision tree for SPARK
      Scala
      Apache License 2.0
      2400Updated Sep 12, 2014Sep 12, 2014
    • OSMEF

      Public
      OpenStack Measurement Framework
      Python
      Apache License 2.0
      1300Updated Aug 13, 2014Aug 13, 2014
    • Hadoop implementation of KNN graph building algorithms (Brute force, NNDescent, NNCtph, ...)
      Java
      3000Updated Jul 28, 2014Jul 28, 2014
    • HFSP

      Public
      The Hadoop Fair Sojourn Protocol Scheduler
      Java
      Apache License 2.0
      4100Updated Jan 14, 2014Jan 14, 2014
    • Java
      Apache License 2.0
      7400Updated Oct 15, 2013Oct 15, 2013
    • A set of tools to analyse Hadoop logs
      Python
      Apache License 2.0
      5000Updated Jul 1, 2013Jul 1, 2013
    • This project deals with the implementation of k-means for multi-dimensional clustering.
      Scala
      4000Updated Jun 20, 2013Jun 20, 2013
    • SWIM

      Public
      Statistical Workload Injector for MapReduce - Project at UC Berkeley AMP Lab
      Java
      92000Updated Jun 27, 2012Jun 27, 2012