Infochimps Logo
  • How It Works
  • Technologies
  • Data Marketplace
  • Blog
  • Contact

We’re a Ruby shop that works with big data. Here are the open source tools that we’ve contributed to or built to make this happen.

Many thanks to the other contributors to these projects. If you are one of them, maybe you should join our team.

Wukong is Ruby for Hadoop—it makes Hadoop so easy a chimpanzee can use it.

  • owner
  • contributors
  • main contributor

Cluster_chef is a powerful tool for maintaining and describing the software configurations that let a machine provide its services.

  • owner
  • contributors
  • main contributor

Swineherd is for running scripts and workflows on filesystems.

  • owner
  • contributors
  • main contributor

HbaseBulkloader is a bulkloader for HBase that explores various strategies. Includes Apache Pig load and store functions.

  • owner
  • contributors
  • main contributor

IMW is the Infinite Monkeywrench (IMW) is a Ruby frameworks to simplify the tasks of acquiring, extracting, transforming, loading, and packaging data.

  • owner
  • contributors
  • main contributor

Wonderdog is a bulkloader for Elastic Search. Includes a simple storefunc for Apache Pig.

  • owner
  • contributors
  • main contributor

The ChimpMARK-2010 is a collection of massive real-world data sets, interesting real-world problems, and simple example code to solve them.

  • owner
  • contributors
  • main contributor
About Us
  • About
  • Team
  • Careers
  • Press
Legal
  • Terms of Use
  • Security
  • Privacy
  • Copyright
Infochimps
  • How It Works
  • Technologies
  • Blog
  • Support
Data Marketplace
  • Geo APIs
  • Social APIs
  • Documentation
  • Sign Up/Login
  • Twitter
  • Facebook
  • Github
© 2012 Infochimps, Inc.
All Rights Reserved.