Crunching BigData with HPCC and Ensemble

Posted by kim0 // August 19th, 2011 // Uncategorized

BigData is a term that has been making a lot of buzz lately. Basically with the explosion of data sources on the internet, the amount and rate of data generation is simply gigantic. Handling such huge amounts of data requires horizontal scalability software architectures that can handle storing and analyzing such amounts of data efficiently. Hadoop has tradionally filled this niche in the open-source world, more recently however HPCC has been trying take hadoop’s place as the preferred open-source bigdata hammer

Installing, managing and scaling such tools however, is usually a daunting task. HPCC is no exception, however with Ensemble starting-up a multinode HPCC cluster is something you can finish in about a minute. Here is a video where I show you what it takes to deploy HPCC and scale it up to two nodes, on the EC2 cloud using Ensemble

If you can’t see the embeded video, here is a direct link:

So, what’s your impressions? Leave me a comment, let me know about it

5 Responses to “Crunching BigData with HPCC and Ensemble”

  1. Juan Negron says:

    Great job on this kim0! Very nice!
    I have a more technical discussion about the hpcc formula for those interested here:


  2. kim0 says:

    If you’re interested in more details, check out this great article by the formulas author

  3. Trish McCall says:

    Thank you for sharing this video! If you haven’t already, I invite you to join our community and visit our forums.

  4. kim0 says:

    Thanks Benjamin! I think I will be doing that