<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>Ubuntu Cloud Portal &#187; featured</title>
	<atom:link href="http://cloud.ubuntu.com/tag/featured/feed/" rel="self" type="application/rss+xml" />
	<link>http://cloud.ubuntu.com</link>
	<description>Ubuntu Cloud Portal</description>
	<lastBuildDate>Wed, 15 Feb 2012 22:48:23 +0000</lastBuildDate>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.org/?v=3.3.1</generator>
		<item>
		<title>juju Charm School Webinar, March 8th</title>
		<link>http://cloud.ubuntu.com/2012/02/juju-charm-school-webinar-march-8th/</link>
		<comments>http://cloud.ubuntu.com/2012/02/juju-charm-school-webinar-march-8th/#comments</comments>
		<pubDate>Wed, 15 Feb 2012 22:48:23 +0000</pubDate>
		<dc:creator>jorge</dc:creator>
				<category><![CDATA[Uncategorized]]></category>
		<category><![CDATA[cloud]]></category>
		<category><![CDATA[featured]]></category>
		<category><![CDATA[planet]]></category>

		<guid isPermaLink="false">http://cloud.ubuntu.com/?p=32031</guid>
		<description><![CDATA[Please join us on 8th March at 5.00pm GMT for a live juju Charm School webinar: To sign up click here: Juju charm school webinar. This charm school will cover how to create charms and use them with juju, and how to submit charms to the larger server community so that your software is easily [...]]]></description>
			<content:encoded><![CDATA[<p>Please join us on 8th March at 5.00pm GMT for a live juju Charm School webinar:</p>
<p>To sign up click here: <a href="http://www.brighttalk.com/webcast/6793/41933">Juju charm school webinar</a>. This charm school will cover how to create charms and use them with juju, and how to submit charms to the larger server community so that your software is easily deployable in the cloud. If you are deploying to the cloud or have software that you&#8217;d like to make easily available to Ubuntu Server users then this is the event for you! </p>
<p>Attendees are encouraged to watch the <a href="http://www.brighttalk.com/webcast/6793/39309">first webinar</a> so that we can concentrate on more advanced topics for this Charm School. </p>
<p>Can&#8217;t make it? We&#8217;ve got in person Charm Schools <a href="https://juju.ubuntu.com/Events">throughout the year</a> if you&#8217;re interested in attending, or you can just <a href="https://juju.ubuntu.com/">contact us</a>.</p>
]]></content:encoded>
			<wfw:commentRss>http://cloud.ubuntu.com/2012/02/juju-charm-school-webinar-march-8th/feed/</wfw:commentRss>
		<slash:comments>2</slash:comments>
		</item>
		<item>
		<title>Ubuntu Server Survey 2012</title>
		<link>http://cloud.ubuntu.com/2012/02/ubuntu-server-survey-2012/</link>
		<comments>http://cloud.ubuntu.com/2012/02/ubuntu-server-survey-2012/#comments</comments>
		<pubDate>Tue, 14 Feb 2012 10:18:12 +0000</pubDate>
		<dc:creator>Gerry Carr</dc:creator>
				<category><![CDATA[cloud]]></category>
		<category><![CDATA[engineering]]></category>
		<category><![CDATA[featured]]></category>
		<category><![CDATA[open source]]></category>
		<category><![CDATA[PlanetUbuntu]]></category>
		<category><![CDATA[server]]></category>
		<category><![CDATA[services]]></category>

		<guid isPermaLink="false">http://blog.canonical.com/?p=1242</guid>
		<description><![CDATA[The Ubuntu Server Survey is finally ready to be published it makes for a fascinating read. It is the third survey of its kind and again it has been an overwhelming response with over 6,000 completed surveys throughout 2011 and a heartfelt thanks to all who took the time to complete the comprehensive survey. The [...]]]></description>
			<content:encoded><![CDATA[<p><a title="Ubuntu Server Survey 2012" href="http://www.canonical.com/sites/www.canonical.com/files/active/images/server_survey_2012.pdf">The Ubuntu Server Survey</a> is finally ready to be published it makes for a fascinating read. It is the third survey of its kind and again it has been an overwhelming response with over 6,000 completed surveys throughout 2011 and a heartfelt thanks to all who took the time to complete the comprehensive survey.</p>
<p>The overwhelming impression is the widespread use of Ubuntu both geographically as you might expect with respondents from across the globe. but also in the broad range of workloads in which Ubuntu Server finds itself used. Every category from web and data servers to cloud shows up strongly albeit with a strong bias towards traditional workloads.</p>
<p><a href="http://blog.canonical.com/wp-content/uploads//2012/02/Ss_2012_Importance-of-features.jpg"><img class="aligncenter size-medium wp-image-1243" title="importance of features" src="http://blog.canonical.com/wp-content/uploads//2012/02/Ss_2012_Importance-of-features-268x300.jpg" alt="" width="268" height="300" /></a></p>
<p>As we approach an LTS, again we see evidence of the popularity of the extended support releases. Given we have run this survey three times now over the past three years now we begin to see strong evidence of the switching from one LTS to the next, particularly as the deployment platform, so our user base is certainly staying with us as as we introduce new features and support them in the long term.</p>
<p><a href="http://blog.canonical.com/wp-content/uploads//2012/02/Ss_2012_Server-version-used-11.jpg"><img class="aligncenter size-medium wp-image-1244" title="Ubuntu Server version used 2011" src="http://blog.canonical.com/wp-content/uploads//2012/02/Ss_2012_Server-version-used-11-233x300.jpg" alt="" width="233" height="300" /></a></p>
<p>Virtualization and cloud are now key elements of Ubuntu use, and for the first time we see KVM overtake Xen as the preferred virtualization technology for Ubuntu users, significant as the platform was the first to make the switch to supporting KVM as the native technology. With that though, VMWare remains the most cited virtualization technology showing a healthy mixture of open source and other technologies at use in the Ubuntu user base.</p>
<p><a href="http://blog.canonical.com/wp-content/uploads//2012/02/Ss_2012_Virtualisation-choices.jpg"><img class="aligncenter size-medium wp-image-1245" title="Ss_2012_Virtualisation choices" src="http://blog.canonical.com/wp-content/uploads//2012/02/Ss_2012_Virtualisation-choices-300x291.jpg" alt="" width="300" height="291" /></a></p>
<p>The respondents consideration of cloud makes for interesting reading too. There is significant interest but the use of Ubuntu Server on bare metal remains the primary use case for most users today. There is strong recognition though of the emergence of this powerful technology and with the plans for ease of installation and orchestration in 12.04 LTS it will be interesting to see how this moves the dial in regards to uptake in the Ubuntu base. A deeper analysis  shows a bias towards larger companies (i.e. respondents with more servers) using cloud technologies which is to be expected and overwhelmingly there is recognition of the suitability of Ubuntu Cloud as a basis for those efforts.</p>
<p><a title="Ubuntu Server Survey 2012" href="http://tinyurl.com/7pecwmc">Enjoy the full report</a>, it would be very interesting to hear your comments.</p>
<p>&nbsp;</p>
]]></content:encoded>
			<wfw:commentRss>http://cloud.ubuntu.com/2012/02/ubuntu-server-survey-2012/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Which is less expensive: Amazon or self-hosted?</title>
		<link>http://cloud.ubuntu.com/2012/02/which-is-less-expensive-amazon-or-self-hosted/</link>
		<comments>http://cloud.ubuntu.com/2012/02/which-is-less-expensive-amazon-or-self-hosted/#comments</comments>
		<pubDate>Sat, 11 Feb 2012 17:00:11 +0000</pubDate>
		<dc:creator>Charlie Oppenheimer, Matrix Partners</dc:creator>
				<category><![CDATA[cloud computing]]></category>
		<category><![CDATA[featured]]></category>
		<category><![CDATA[technologyinternet]]></category>
		<category><![CDATA[Uncategorized]]></category>
		<category><![CDATA[web services]]></category>

		<guid isPermaLink="false">http://gigaom.com/?p=483678</guid>
		<description><![CDATA[Charlie Oppenheimer may be a fan of Amazon Web Services. But, as he explains here, he's long felt that the economics of the choice between self-hosted and cloud provider had more texture to it than the patently attractive sounding “10 cents an hour."<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=gigaom.com&#38;blog=14960843&#38;post=483678&#38;subd=gigaom2&#38;ref=&#38;feed=1" width="1" height="1" />]]></description>
			<content:encoded><![CDATA[<p><strong>Updated. </strong><a href="http://aws.amazon.com/">Amazon Web Services</a> (AWS), as the trailblazing provider of Infrastructure as a Service (IaaS), has changed the dialog about computing infrastructure. Today, instead of simply assuming that you’ll be buying and operating your own servers, storage and networking, AWS is always an option to consider, and for many new businesses, it’s simply the default choice.</p>
<p>I’m a huge fan of cloud computing in general and AWS in particular. But I’ve long had an instinct that the economics of the choice between self-hosted and cloud provider had more texture to it than the patently attractive sounding “10 cents an hour,” particularly as a function of demand distribution. As a case in point, Zynga has made it known that for economic reasons, they now use their own infrastructure for baseline loads and use Amazon for peaks and variable loads surrounding new game introductions.</p>
<h2>An analysis of the load profiles</h2>
<p>To tease out a more nuanced view of the economics, I’ve built a detailed Excel model that analyzes the relative costs and sensitivities of AWS versus self-hosted in the context of different load profiles. By “load profiles,” I mean the distribution of demand over the day/month as well as relative needs for bandwidth versus compute resources. The load profile is the key factor influencing the economic choice because it determines what resources are required and how heavily these resources are utilized.</p>
<p>The model provides a simple way to analyze various load profiles and allows one to skew the load between bandwidth-heavy, compute-heavy or any combination. In addition, the model presents the cost of operating 100 percent on AWS, 100 percent self-hosted as well as all hybrid mixes in between.</p>
<p>In a subsequent post, I will share the model and describe how you can use it for scenarios of interest to you. But for this post, I will outline some of the conclusions that I’ve derived from looking at many different scenarios. In most cases, the analysis illustrates why intuition is right (for example, that a highly variable compute load is a slam dunk for AWS). In other cases, certain high-sensitivity factors become evident and drive the economic answer. There are also cases where a hybrid infrastructure is at least worthy of consideration.</p>
<p><a href="http://gigaom.com/2012/02/11/which-is-less-expensive-amazon-or-self-hosted/oppenheimer-graphic1-2/" rel="attachment wp-att-483686"><img  title="Oppenheimer graphic1" src="http://gigaom2.files.wordpress.com/2012/02/oppenheimer-graphic11.jpg?w=604&#038;h=335" alt="" width="604" height="335" class="alignleft size-large wp-image-483686" /></a></p>
<p>To frame an example analysis, here is the daily distribution of a typical Internet application. In the model, traffic distribution is an input from which bandwidth requirements are computed. The distribution over the day reflects the behavior of the user base (in this case, one with a high U.S. business-hour activity peak). Computing load is assumed to follow traffic according to a linear relationship, i.e. higher traffic implies higher compute load.</p>
<p>Note that while labor costs are included in the model, I am leaving them out of this example for simplicity. Because labor is a mostly fixed cost for each alternative, it will tend not to impact the relative comparison of the two alternatives. Rather, it will impact where the actual break-even point lies. If you use the model to examine your own situation, then of course I would recommend including the labor costs on each side.</p>
<p><a href="http://gigaom.com/2012/02/11/which-is-less-expensive-amazon-or-self-hosted/oppenheimer-graphic2/" rel="attachment wp-att-483689"><img  title="Oppenheimer graphic2" src="http://gigaom2.files.wordpress.com/2012/02/oppenheimer-graphic2.jpg?w=604&#038;h=291" alt="" width="604" height="291" class="alignleft size-large wp-image-483689" /></a>For this example, to compute costs for Amazon, I have assumed Standard Extra Large instances and ELB load balancer for the Northern California region. The model computes the number of instances required for each hour of the day. Whenever the economics dictate it, the model applies as many AWS Reserved Instances (capacity contracts with lower variable costs) as justified and fills in with on-demand instances as required. Charges for data are computed according to the progressive pricing schedule that Amazon publishes. To compute costs for self-hosting, I assume co-location with the peak number of Std-XL-equivalent servers required, each loaded to no more than 80 percent of capacity. The costs of hardware are amortized over 36 months. Power is assumed to be included with rackspace fees. Bandwidth is assumed to be obtained on a 95th percentile price basis.</p>
<p>Now let’s look at a sensitivity analysis. Notice in the above example, that a bit more than half of the total cost for each alternative is for bandwidth/data transfer charges ($35,144 for self-hosted at $8/Mbps and $36,900 for AWS). This is important because while Amazon pricing is fixed and published, 95th percentile pricing is highly variable and competitive</p>
<p><a href="http://gigaom.com/2012/02/11/which-is-less-expensive-amazon-or-self-hosted/oppenheimer-graphic3-2/" rel="attachment wp-att-483699"><img  title="Oppenheimer graphic3" src="http://gigaom2.files.wordpress.com/2012/02/oppenheimer-graphic31.jpg?w=604&#038;h=398" alt="" width="604" height="398" class="alignleft size-large wp-image-483699" /></a></p>
<p>The chart above shows total costs as a function of co-location bandwidth pricing. AWS costs are independent of this and thus flat. What this chart shows is that self-hosting costs less for any bandwidth pricing under about $9.50 per Mbps/Month. And if you can negotiate a price as low as $4, you’d be saving more than 40 percent to self-host. I’ll leave discussion of the hybrid to another post.</p>
<p><a href="http://gigaom.com/2012/02/11/which-is-less-expensive-amazon-or-self-hosted/oppenheimer-graphic4/" rel="attachment wp-att-483691"><img  title="Oppenheimer graphic4" src="http://gigaom2.files.wordpress.com/2012/02/oppenheimer-graphic4.jpg?w=604&#038;h=306" alt="" width="604" height="306" class="alignleft size-large wp-image-483691" /></a>This should provide a bit of a feel for how I’ve been conducting these analyses. Above is a visual summary of how different scenarios tend to shake out. The intuitive conclusion that the more spiky the load, the better the economics of the AWS on-demand solution is confirmed. And similarly, the flatter or less variable the load distribution, the more self-hosting appears to make sense. And if you’ve got a situation that uses a lot of bandwidth, you need to look more closely at potential self-hosted savings that could be feasible with negotiated bandwidth reductions.</p>
<p><strong>Update (Feb. 14): </strong>This post has garnered a lot of much appreciated attention. From the comments, I see that two clarifications would be helpful:</p>
<ol>
<li>The key point here is that a comparison of the cost of cloud hosting versus self-hosting needs to be based on the profile of your load. It is <span style="text-decoration: underline;">not</span> that Amazon (or any other provider) is more expensive than self-hosting, as this is often not the case. Rather, it depends on the profile of your load. Moreover, it’s not so important where exactly your breakeven point is but rather it is most important to know the main sensitivities (e.g. bandwidth cost, CPU load, storage, etc.) for your situation so that you can understand which differences could flip the decision. <em>The results here are <span style="text-decoration: underline;">for this example only</span> and other examples will produce different results, some in favor of cloud and some in favor of self-hosting.</em></li>
<li>The specific use case I’ve chosen is for a business that’s pretty far along. But some people have been wondering how this example applies to startups. That’s a great question.</li>
</ol>
<p>While I’ve referred to “spiky” loads, there’s another way to say that which is “variable,” “unknown” or “unpredictable,” which describes the situation that a startup (or other new business endeavor) usually finds itself in. In those cases, the fact that you cannot forecast very well is a reason why it’s highly unlikely you’ll save money by self-hosting…because you’re very unlikely to buy the right amount of capacity. You’ll either overprovision and waste money on unused capacity, or you’ll buy too little and compromise the business. So while you might not call your startup load “spiky,” the fact that it’s unpredictable gives it a similar profile in the model and hence the economic conclusion would tell you to go with the cloud infrastructure route.</p>
<p>Another not-strictly-economic respect that needs to be considered for startups (and others) is the benefit of focusing one’s attention on primary value-creating activities versus commodity activities (relative to the business) that one might not be very good at anyway. In addition, AWS and other cloud providers give us the highly valuable ability to experiment with little downside. This is especially important for the highly iterative and trial-and-error nature of building successful Internet businesses.</p>
<p>The point of this particular example is that if you have a significant amount of load that is well known and predictable then you may be able to save some money by bringing a portion or all of that inside.</p>
<p><em>Charlie Oppenheimer is a serial-CEO and currently an executive-in-residence at venture-capital firm </em><a href="http://matrixpartners.com/"><em>Matrix Partners</em></a><em>. His most recent company, Digital Fountain, was acquired by Qualcomm, and his previous company, Aptivia, was acquired by Yahoo. He blogs at </em><a href="http://stratamotion.com"><em>stratamotion.com</em></a><em>. </em></p>
<p><strong>Related research and analysis from GigaOM Pro:</strong><br />Subscriber content. <a href="http://pro.gigaom.com/?utm_source=tech&utm_medium=editorial&utm_campaign=auto3&utm_term=483678+which-is-less-expensive-amazon-or-self-hosted&utm_content=aprilkilcrease">Sign up for a free trial</a>.</p><ul><li><a href="http://pro.gigaom.com/2011/07/newnet-q2-google-closes-the-quarter-with-a-bang/?utm_source=tech&utm_medium=editorial&utm_campaign=auto3&utm_term=483678+which-is-less-expensive-amazon-or-self-hosted&utm_content=aprilkilcrease">NewNet Q2: Google closes the quarter with a&nbsp;bang</a></li><li><a href="http://pro.gigaom.com/2011/06/from-car-to-cloud-the-future-of-the-in-vehicle-app-landscape/?utm_source=tech&utm_medium=editorial&utm_campaign=auto3&utm_term=483678+which-is-less-expensive-amazon-or-self-hosted&utm_content=aprilkilcrease">From car to cloud: the future of the in-vehicle app&nbsp;landscape</a></li><li><a href="http://pro.gigaom.com/2011/04/finding-the-value-in-social-media-data/?utm_source=tech&utm_medium=editorial&utm_campaign=auto3&utm_term=483678+which-is-less-expensive-amazon-or-self-hosted&utm_content=aprilkilcrease">Finding the Value in Social Media&nbsp;Data</a></li></ul><img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=gigaom.com&amp;blog=14960843&amp;post=483678&amp;subd=gigaom2&amp;ref=&amp;feed=1" width="1" height="1" />]]></content:encoded>
			<wfw:commentRss>http://cloud.ubuntu.com/2012/02/which-is-less-expensive-amazon-or-self-hosted/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
<enclosure url="http://gigaom2.files.wordpress.com/2012/02/oppenheimer-graphic4.jpg?w=604" length="" type="" />
<enclosure url="http://gigaom2.files.wordpress.com/2012/02/oppenheimer-graphic31.jpg?w=604" length="" type="" />
<enclosure url="http://gigaom2.files.wordpress.com/2012/02/oppenheimer-graphic2.jpg?w=604" length="" type="" />
<enclosure url="http://1.gravatar.com/avatar/f61183cf1974afda4981596f4a1e7cde?s=96&amp;amp;d=retro&amp;amp;r=PG" length="" type="" />
<enclosure url="http://gigaom2.files.wordpress.com/2012/02/oppenheimer-graphic11.jpg?w=604" length="" type="" />
<enclosure url="http://gigaom2.files.wordpress.com/2012/02/oppenheimer-graphic41.jpg?w=210" length="" type="" />
		</item>
		<item>
		<title>Open Cloud Initiative Is Dead Long Live OCI</title>
		<link>http://cloud.ubuntu.com/2012/02/open-cloud-initiative-is-dead-long-live-oci/</link>
		<comments>http://cloud.ubuntu.com/2012/02/open-cloud-initiative-is-dead-long-live-oci/#comments</comments>
		<pubDate>Thu, 09 Feb 2012 21:54:44 +0000</pubDate>
		<dc:creator>Krishnan Subramanian</dc:creator>
				<category><![CDATA[Uncategorized]]></category>
		<category><![CDATA[cloud]]></category>
		<category><![CDATA[cloud computing]]></category>
		<category><![CDATA[debate]]></category>
		<category><![CDATA[featured]]></category>
		<category><![CDATA[Featured Posts]]></category>
		<category><![CDATA[insights]]></category>
		<category><![CDATA[open cloud]]></category>
		<category><![CDATA[open source]]></category>
		<category><![CDATA[Trends & Concepts]]></category>

		<guid isPermaLink="false">http://www.cloudave.com/?p=17127</guid>
		<description><![CDATA[First let me make it clear that Open Cloud Initiative (OCI) is not dead and it is going to stay for a long time advocating openness. Also, I will fight all I can to keep it going. Having said that I am writing this post to re-emphasize something which I have been saying all along. [...]]]></description>
			<content:encoded><![CDATA[<a href="http://www.cloudave.com/wordpress/wp-content/uploads/2012/02/opencloud.jpg?adaf63"><img class="alignright  wp-image-17129" title="opencloud" src="http://www.cloudave.com/wordpress/wp-content/uploads/2012/02/opencloud-100x100.jpg?adaf63" alt="" width="160" height="160" /></a>First let me make it clear that Open Cloud Initiative (OCI) is not dead and it is going to stay for a long time advocating openness. Also, I will fight all I can to keep it going. Having said that I am writing this post to re-emphasize something which I have been saying all along. I am also going to use this post as a reference whenever a myth is promoted in the social media circles on this topic. Unlike the cathedrals of proprietary vendors, all the debates about open source and other open topics occur in the open (pun intended). In the typical open source spirit, I will vent my thoughts (once again) on this topic here. For beginners, I have already written about this in my <a href="http://www.cloudave.com/14011/oscon-week-open-cloud-initiative-launched-to-drive-open-standards-in-cloud-computing/">introductory post on OCI</a>.
<blockquote>On the other hand, I am neutral because open source is included as an afterthought in the requirements. There are two schools of thoughts among those who advocate openness in the cloud world. One school, spearheaded by Tim O’Reilly, emphasizes on open protocols, open formats, open architecture, etc. as the necessary conditions for openness. They claim that licensing is irrelevant in the cloud services world. The other school, slightly old fashioned and in minority, claim that open source is equally important in ensuring the openness in the cloud based world. I belong to the second group and I have argued in favor of the importance of open source in the cloud world here and in other fora. For me, open source becomes a requirement because it is the only way we can have a more federated interoperable cloud ecosystem. In the absence of open source, the barriers for participation becomes very high and we may face the prospect of monopoly of cloud providers offering services.</blockquote>
I also highlighted the same thing in a talk at a Cloud Bootcamp at Santa Clara in the sidelines of Cloud Expo and my slides from the talk can be found <a href="http://www.slideshare.net/krishnan/the-importance-of-open-source-in-cloud-computing">here</a>.

<strong>Argument:</strong> When you move from software to services, open source doesn’t matter and only open standards matter

<strong>My counterargument:</strong> I do agree that open standards (open protocols, open formats, etc.) are the key to eliminate cloud lock-in. The biggest concern against large scale cloud adoption is the risk of getting locked into proprietary clouds. Open standards are key to avoid such a lock-in. There is no doubt about it and it is extremely important that we raise the awareness about open standards so that cloud users are protected. However, dismissing open source as irrelevant is shortsighted at the best. Yes, open standards might help users from getting locked into a single vendor but, in the absence of open source, they will be locked into handful of vendors. We saw what happened when only a handful of players meet the needs of an entire country with US wireless industry. They stymied innovation for a long term because they were hell bent upon protecting their existing cash cow than really letting their services to be used for further innovation. Open Source doesn’t guarantee innovation in the technology field but it lowers the barriers so much that it opens up opportunity for others to get into the market, innovate and, more importantly, ensure that the end users are not taken for a ride. Imagine if we would have seen the cloud as AWS introduced to the world in the absence of open source licenses? Do you think Microsoft would have been flexible with their licenses to let Amazon develop a service that will eventually come back to bite them? Open source is critical for cloud computing and it is now helping, in the form of OpenStack, CloudFoundry and others, to ensure that there are not handful of cloud providers who could eventually grow their market power to stymie innovation like the US wireless companies. I strongly believe in demanding open standards but it is quite possible to work around its absence if there is open source, a fact once again <a href="http://bradhedlund.com/2012/02/08/dodging-open-protocols-with-open-software/">highlighted by this brilliant post by Brad Hedlund</a>. No, I am nowhere close to claiming that we don’t need to focus on open standards but I am only arguing that ignoring open source and focussing only on open standards is <strong>s h o r t s i g h t e d</strong>. Period.

<strong>Argument:</strong> Why would a consumer of a service need its source code?

<strong>My counterargument:</strong> The biggest problem with opponents and some proponents of open source is that they really don’t get it. Open source is not about consumption but about its power of enablement. Whether it is the case of software or service, it is the same. Even in the software world, every single user of open source software didn’t take the source code and look at it. Only a small percentage of users who wanted to modify the source code to scratch their itch really used the code. It is clearly the case of enablement than consumption in the software world and it is going to be the same in the services world. Consumers of services are going to give a damn about source code much like the consumers of software but the availability of code is going to enable many providers to scratch the itch and offer services to <strong>meet the more diverse needs</strong> not addressed by the original set of service providers. My point is: it doesn’t matter what we are talking is software or service, open source is an enabler of openness (and innovation) and, therefore, it is equally critical as open standards.

Open standards is about not getting your data locked in but open source is needed if you want to enable the users to run their workloads after that. What is the point in having my data out of a provider if I don’t have the resources available (at a cost affordable to me) to have applications that can act on that data? A truly open cloud should allow me to not just take my data out but also give me <strong>opportunities</strong> to use the data elsewhere without being held hostage by any group (of providers). If the definition of open cloud doesn’t give me this opportunity, then it is meaningless as far as I am concerned.

<strong>Argument:</strong> But, hey, we demand that at least one implementation should be open source

<strong>My counterargument:</strong> This afterthought addition of open source in the open cloud definition is what frustrates me the most. I really really couldn’t get this argument. Why would a proprietary cloud vendor spend critical resources (including tons of money) implementing an open source implementation just to get certified as open cloud by OCI? If market pressures forced the vendor to support open protocols, they will just enable that and satisfy the needs of their market. If the market pressure doesn’t exist, they would not give a damn to open source or open standards anyhow. Microsoft is a good example of market pressures forcing them to open up than some certification agency. Instead if OCI puts open source at the center, along with open standards, for the very definition of open clouds, it will at least motivate the large open source cloud ecosystem (it is growing by leaps and bounds every day) to get certified by OCI. Believe me, I have spoken to at least 5 service providers and platform vendors on this open source cloud ecosystem and they just don’t care about OCI for the very reasons I have highlighted above. They feel that they need not get OCI certified to be seen as a player embracing openness. I am pretty sure this is the thinking with many others in that ecosystem.

<strong>Argument:</strong> What OCI has is the middle ground that will help bring proprietary cloud vendors on board

<strong>My counterargument:</strong> As I told above, what is the incentive for them to come to OCI? If a company believes in the proprietary approach (believe me, it is not a wrong approach at all and what matters is that customers should have choices and proprietary software is one such choice), why would they even worry about openness unless there is market pressure? When there is market pressure they will anyhow adopt open standards and meet the needs. They really don’t give a damn about embracing openness mantra through OCI certification. However, this approach of OCI is a big put off for companies which have openness at the heart and have open source at the core of their clouds. In today’s world, it is a big part of the cloud ecosystem and they feel OCI is not needed to showcase their openness because they have open source in their DNA. OCI can create the market pressure needed to force proprietary cloud vendors to embrace open standards ONLY if they could convince these open source cloud vendors to come on board in large numbers. Why am I not hearing any excitement about OCI in the OpenStack community? The only group that will really benefit from this “middle ground” are those proprietary vendors who are lagging behind in the marketplace but want to use openness mantra to catch up. Yes, the biggest benefactors will be those who want to open wash.

If OCI’s intention is to put pressure on proprietary cloud providers to open up, they are doing it all wrong because whatever they are doing with this so called “middle approach” is not going to add the necessary market pressure. Rather, it has the danger of making OCI irrelevant as more and more open source providers jump in and create the market pressure on their own. I really want OCI to succeed but my efforts to make them see the larger picture is not making any dent. This blog post is my attempt to get the larger community put pressure on OCI to really open up.
<p style="text-align: center;"><strong>Tear down that wall Mr. Johnston!!</strong></p>]]></content:encoded>
			<wfw:commentRss>http://cloud.ubuntu.com/2012/02/open-cloud-initiative-is-dead-long-live-oci/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Automating Openstack Testing on Ubuntu</title>
		<link>http://cloud.ubuntu.com/2012/02/automating-openstack-testing-on-ubuntu/</link>
		<comments>http://cloud.ubuntu.com/2012/02/automating-openstack-testing-on-ubuntu/#comments</comments>
		<pubDate>Wed, 08 Feb 2012 10:11:55 +0000</pubDate>
		<dc:creator>JavaCruft</dc:creator>
				<category><![CDATA[Uncategorized]]></category>
		<category><![CDATA[cloud]]></category>
		<category><![CDATA[featured]]></category>
		<category><![CDATA[juju]]></category>
		<category><![CDATA[openstack]]></category>
		<category><![CDATA[Q+A]]></category>
		<category><![CDATA[ubuntu]]></category>

		<guid isPermaLink="false">http://javacruft.wordpress.com/?p=286</guid>
		<description><![CDATA[During the Ubuntu precise development cycle the Canonical Platform Server Team have been working on automating testing of Openstack on Ubuntu. The scope of this work was: Per-commit testing of Openstack trunk to evaluate the current state of the upstream codebase in-conjunction with the current packaging in Ubuntu precise and the current Juju charms to [...]<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=javacruft.wordpress.com&#38;blog=16060086&#38;post=286&#38;subd=javacruft&#38;ref=&#38;feed=1" width="1" height="1" />]]></description>
			<content:encoded><![CDATA[During the Ubuntu precise development cycle the Canonical Platform Server Team have been working on automating testing of Openstack on Ubuntu.

The scope of this work was:
<ol>
	<li>Per-commit testing of Openstack trunk to evaluate the current state of the upstream codebase in-conjunction with the current packaging in Ubuntu precise and the current Juju charms to deploy Openstack.</li>
	<li>SRU testing for Openstack Diablo on Ubuntu 11.10.</li>
</ol>
Openstack do a lot of pre-commit testing through the use of <a href="http://review.openstack.org">gerrit</a> with <a href="http://jenkins.openstack.org">Jenkins</a>; we wanted to supplement this with Ubuntu focused testing to provide another dimension to the testing already completed upstream.

So grab a coffee and make yourself comfortable; this is not a short read….

<strong>Lab Setup</strong>

The Ubuntu Openstack QA lab consists of 12 servers; the primary server in the solution is an Ubuntu 11.10 install providing the following functions:
<ol>
	<li><a href="http://juju.ubuntu.com">Juju</a> – used to deploy Openstack charms in the Lab</li>
	<li>Cobbler to support server provisioning (using the Ubuntu Orchestra packages in Oneiric)</li>
	<li>Jenkins CI – provides triggering based on upstream commits to github repositories and general job control and reporting.</li>
	<li>Schroots for Oneiric and Precise for building packages locally</li>
	<li>A reprepro managed local archive for Oneiric and Precise</li>
	<li>Squid based archive caching to reduce installation times in the lab</li>
</ol>
This server also acts at the gateway into and out of the Lab (it’s setup as a NAT router).

The other 11 servers are registered in Cobbler; All servers are connected to a Sentry CDU (Cabinet Distribution Unit) which allows full power control from Cobbler – thanks goes to Andres Rodriguez for developing the required fence component for Cobbler to support this type of CDU.

<strong>Preseeded LVM Snapshot Installs</strong>

To initiate a new integration test run requires all machines to be powered down and re-provisioned from scratch.  It is essential that our deployment and test runs can cope the frequency of upstream commits, particularly as the frequency increases as Openstack approaches milestones and releases.   After getting the initial lab setup in place, we were able to tear down all machines, re-provision and deploy Openstack in ~30mins.

It was important that we are able to minimize the time taken to complete the testing cycle.   To do so, we’ve employed the use of LVM snapshotting and restoration of the root partition during the the netboot installation.   The process is as follows:
<ol>
	<li>Test run begins</li>
	<li>Juju deploys a service (i.e. nova-compute)</li>
	<li>A machine is netbooted and a preseeded LVM-based Ubuntu installation takes place onto /dev/qalab/root</li>
	<li>At the end of the installation, the root filesystem is moved to /dev/qalab/pristine-[release]-root and a snapshot created at /dev/qalab/root</li>
	<li>The machine reboots, runs Juju and deploys nova-compute as pat of the rest of the Openstack deployment. This deployment is smoke tested.</li>
	<li>The next test run begins.  All machines are terminated. Juju redeploys nova-compute, a machine is netbooted and Ubuntu installation kicks off.</li>
	<li>The installation checks for the existence of a logical volume at /dev/qalab/pristine-[release]-root.  If it exists, it creates a new snapshot at /dev/qalab/root and reboots. If it does not, continues with installation and goto step 4.</li>
	<li>System reboots, Juju installs and redeploys nova-compute to a fresh Ubuntu installation.</li>
</ol>
This process takes place on all nodes in parallel.  With it in place, we were able to cut down the time it took to tear-down and re-provision a node from ~30 minutes to 10 to 15 minutes depending on the service being deployed.

By taking this approach we are also minimize the chance of any nodes hitting an archive inconsistency during installation. This is a known issue when deploying the development release and halts installation on any node that hits it, failing the entire deployment.

All of this is embedded in debian-installer preseeds via Cobbler snippets.  The snippets and kick starts are available at lp:~openstack-ubuntu-testing/+junk/cobbler-lvm-snapshot.

In the future, we’ll be investigating the use of kexec as an alternative to reboot after snapshot restoration to reduce the time spent waiting on servers to boot.  This should minimize the test cycle even more. Credit to James Blair for the idea (see <a href="http://amo-probos.org/post/11)">http://amo-probos.org/post/11</a><a href="http://amo-probos.org/post/11)">)</a>.

<strong>Management of Jenkins</strong>

All of the projects in Jenkins are managed using Jinja2 XML templates in-conjunction with python-jenkins (<a href="http://launchpad.net/python-jenkins">python-jenkins</a>); this makes it really easy to setup new jobs in the lab and reconfigure existing ones as required (as well as providing great backup!).

Templates and management scripts can be found in lp:~openstack-ubuntu-testing/+junk/jenkins-qa-lab

<strong>Testing Openstack Essex on Ubuntu Precise</strong>

This testing was the first to be setup in the lab.  Jenkins (using the git plugin) monitors the upstream github.com repositories for commits on the master branch.  When a change is detected the following process is triggered:

<strong>Build</strong>

Objective: Validate that upstream trunk still builds OK with current packaging for Ubuntu.
<ol>
	<li>A new snapshot upstream tarball is generated based on the latests commit to the upstream component.</li>
	<li>The latest archive packaging for the component is pulled in from lp:~ubuntu-server-dev/&lt;COMPONENT&gt;/essex</li>
	<li>Any changes in the testing packaging for the component are merged from lp:~openstack-ubuntu-testing/&lt;COMPONENT&gt;/essex</li>
	<li>New changelog entries are automatically created for the new upstream commits.</li>
	<li>The source package is generated and built in a clean schroot using sbuild locally.</li>
</ol>
On the assumption that the package built OK locally:
<ol>
	<li>The source package is uploaded to the Testing PPA (ppa:openstack-ubuntu-testing/testing)</li>
	<li>The testing packaging branch is push back to lp:~openstack-ubuntu-testing/&lt;COMPONENT&gt;/essex.</li>
	<li>The binary packages from the sbuild are installed into the local reprepro managed archive.</li>
</ol>
This process is managed by a single script (<a href="http://bazaar.launchpad.net/~openstack-ubuntu-testing/+junk/jenkins-scripts/view/head%3A/tarball.sh">tarball.sh</a>); Credit to Chuck Short for pulling together this part of the process based on work from Openstack upstream.

For changes to the nova project the deploy phase is then executed.

<strong>Deploy</strong>

Objective: Validate that packages install, can be configured and reach a know good state prior to execution of testing.

This phase of testing uses Juju with Cobbler to deploy Openstack into the QA lab infrastructure; It utilizes branches of the Openstack charms to support use of a local archive along with a deployer wrapper around Juju written by Adam Gandelman which executes the actual deployment using Juju and monitors for errors.

<a href="http://javacruft.files.wordpress.com/2012/02/running-openstack.jpg"><img class="aligncenter size-medium wp-image-296" title="Openstack Test Deployment" src="http://javacruft.files.wordpress.com/2012/02/running-openstack.jpg?w=300&amp;h=118" alt="" width="300" height="118" /></a>

The deployer is configured to know where to get the right codebase for the Openstack charms, which services to deploy and which relations to setup between services. As you can see from the above diagram this is non-trivial but the charms and Juju do most of the hard work.

Once Openstack is deployed successfully the test phase is then executed.

<strong>Test</strong>

Objective: Validate that the Openstack deployment in the lab actually works!

At this point, we can run any integration tests we wish against the newly deployed cloud.  This testing is able to help us achieve multiple goals:
<ul>
	<li>Early detection of upstream bugs that break Openstack functionality on Ubuntu</li>
	<li>Verification that packaging branches in the development version of Ubuntu are compatible with upstream trunk.</li>
	<li>Using these packages, verification that our Juju charms are deploying a functional Openstack cloud and are up-to-date with any deployment-related configuration changes upstream.</li>
</ul>
At the moment this phase looks like this:
<ol>
	<li>Configure the Openstack deployment (Adams deployer script provides some utility functions for locating specific services in the environment)
<ul>
	<li>Creates network configuration in Nova for the private instance network as well as a pool of public floating IPs.</li>
	<li>Upload an image into the Glance server for use during testing</li>
	<li>Creates EC2 credentials in the Keystone server for use during testing.</li>
</ul>
</li>
	<li>Run the devstack exercise test scripts which ensure basic functionality of the deployment. Currently, this includes:
<ul>
	<li>Basic euca-tools EC2 API for starting and stopping instances</li>
	<li>EC2 AMI bundle uploads</li>
	<li>Floating IP allocation, association and connectivity to instance</li>
	<li>Volume creation and attachment to instance</li>
</ul>
</li>
</ol>
Note: These are the same sets of tests that are currently run against proposed commits to gerrit upstream.

Longer term we aim to use the Openstack Tempest test suite in the lab; Adam is currently working on getting this up and running.

<strong>Reporting</strong>

The Jenkins instance in the QA lab is not publicly accessible; however all jobs run in the lab are published out (using the Jenkins build-publisher plugin) to <a href="https://jenkins.qa.ubuntu.com/view/Precise%20OpenStack%20Testing/">http://jenkins.qa.ubuntu.com</a> so that people can see the current state of the testing packaging in Ubuntu precise.

We are also working on setting up email notifications.

<strong>Success so far</strong>

Juju charms deploy Openstack components in a configuration that is compatible with upstream trunk prior to updates to packaging in Ubuntu.  Previously packages were updated in the archive first while Juju charm updates lagged behind as incompatibilities were uncovered after the fact.

We enabled automated testing 2 days prior to the 3rd Essex milestone release.  We were able to uncover and help fix a handful of bugs upstream before the release, including critical bugs like <a href="http://pad.lv/921784">921784</a>.  In the past, these bugs were typical uncovered after the release (both upstream and in Ubuntu).

Since E3, there have been even more critical bugs uncovered by this testing and fixed upstream, some of which are only applicable to Ubuntu-specific configurations (not tested upstream) and would have been uncovered by users after code hit the Ubuntu archive (See <a href="http://pad.lv/922232">922232</a>).

<strong>Further Plans for the Lab</strong>

Pre-commit  testing of changes to stable branches;  The Ubuntu Server team are  working upstream on maintaining the stable branches of released versions  of OpenStack – this work will validate patches proposed to stable  branches in review.openstack.org against the current version of the  packaging in released versions of Ubuntu.  Initially this will target  Diablo on Ubuntu 11.10 but will also support Essex on Ubuntu 12.04 once  released.  Ideally the testing process will provide feedback on  review.openstack.org to help the stable release team review proposed  patches.

<strong>References</strong>

Jenkins job configurations: lp:~openstack-ubuntu-testing/+junk/jenkins-qa-lab

Scripts supporting the lab: lp:~openstack-ubuntu-testing/+junk/jenkins-scripts

LVM snapshot preseeds and Cobbler snippets: lp:~openstack-ubuntu-testing/+junk/cobbler-lvm-snapshot

All other relevant scripts, charm branches, etc: <a href="https://code.launchpad.net/~openstack-ubuntu-testing/">https://code.launchpad.net/~openstack-ubuntu-testing/</a>

<strong>Credits</strong>

Overall management of delivery and general whip cracking: Dave Walker

Lab installation and base configuration: Pete Graner, Tim Gardner, Brad Figg, James Page

Fence agent for network power control of servers: Andres Rodriguez

Source package creation and build process: Chuck Short and James Page

Deployment testing using Juju: Adam Gandelman

Testing of Openstack: Adam Gandelman

Jenkins packaging, configuration and management: James Page

Gerrit Plugin for pre-commit testing and generally great ideas: Monty Taylor and James Blair

Writing and reviewing this post: Adam Gandelman, Chuck Short and Dave Walker.

<a href="http://feeds.wordpress.com/1.0/gocomments/javacruft.wordpress.com/286/" rel="nofollow"><img src="http://feeds.wordpress.com/1.0/comments/javacruft.wordpress.com/286/" alt="" border="0" /></a> <a href="http://feeds.wordpress.com/1.0/godelicious/javacruft.wordpress.com/286/" rel="nofollow"><img src="http://feeds.wordpress.com/1.0/delicious/javacruft.wordpress.com/286/" alt="" border="0" /></a> <a href="http://feeds.wordpress.com/1.0/gofacebook/javacruft.wordpress.com/286/" rel="nofollow"><img src="http://feeds.wordpress.com/1.0/facebook/javacruft.wordpress.com/286/" alt="" border="0" /></a> <a href="http://feeds.wordpress.com/1.0/gotwitter/javacruft.wordpress.com/286/" rel="nofollow"><img src="http://feeds.wordpress.com/1.0/twitter/javacruft.wordpress.com/286/" alt="" border="0" /></a> <a href="http://feeds.wordpress.com/1.0/gostumble/javacruft.wordpress.com/286/" rel="nofollow"><img src="http://feeds.wordpress.com/1.0/stumble/javacruft.wordpress.com/286/" alt="" border="0" /></a> <a href="http://feeds.wordpress.com/1.0/godigg/javacruft.wordpress.com/286/" rel="nofollow"><img src="http://feeds.wordpress.com/1.0/digg/javacruft.wordpress.com/286/" alt="" border="0" /></a> <a href="http://feeds.wordpress.com/1.0/goreddit/javacruft.wordpress.com/286/" rel="nofollow"><img src="http://feeds.wordpress.com/1.0/reddit/javacruft.wordpress.com/286/" alt="" border="0" /></a> <img src="http://stats.wordpress.com/b.gif?host=javacruft.wordpress.com&amp;blog=16060086&amp;post=286&amp;subd=javacruft&amp;ref=&amp;feed=1" alt="" width="1" height="1" border="0" />]]></content:encoded>
			<wfw:commentRss>http://cloud.ubuntu.com/2012/02/automating-openstack-testing-on-ubuntu/feed/</wfw:commentRss>
		<slash:comments>3</slash:comments>
		</item>
		<item>
		<title>What it really means when someone says ‘Hadoop’</title>
		<link>http://cloud.ubuntu.com/2012/02/what-it-really-means-when-someone-says-%e2%80%98hadoop%e2%80%99/</link>
		<comments>http://cloud.ubuntu.com/2012/02/what-it-really-means-when-someone-says-%e2%80%98hadoop%e2%80%99/#comments</comments>
		<pubDate>Mon, 06 Feb 2012 20:12:12 +0000</pubDate>
		<dc:creator>Derrick Harris</dc:creator>
				<category><![CDATA[big data]]></category>
		<category><![CDATA[Cloudera]]></category>
		<category><![CDATA[featured]]></category>
		<category><![CDATA[hadoop]]></category>
		<category><![CDATA[Hbase]]></category>
		<category><![CDATA[Hortonworks]]></category>
		<category><![CDATA[ibm]]></category>
		<category><![CDATA[mapreduce]]></category>
		<category><![CDATA[open source]]></category>

		<guid isPermaLink="false">http://gigaom.com/?p=481182</guid>
		<description><![CDATA[Hadoop features front and center in the discussion of how to implement a big data strategy, one of the biggest trends in IT. There’s just one problem that keeps cropping up: many people don’t seem to know exactly what it means when somebody says “Hadoop.”<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=gigaom.com&#38;blog=14960843&#38;post=481182&#38;subd=gigaom2&#38;ref=&#38;feed=1" width="1" height="1" />]]></description>
			<content:encoded><![CDATA[<p><a href="http://gigaom2.files.wordpress.com/2011/10/hadoop1.jpg"><img title="hadoop" src="http://gigaom2.files.wordpress.com/2011/10/hadoop1.jpg?w=604" alt=""   class="alignleft size-full wp-image-426524"></a>Big data is among the hottest trends in IT right now, and Hadoop stands front and center in the discussion of how to implement a big data strategy. There’s just one problem that keeps cropping up: many people don’t seem to know exactly what it means when somebody says “Hadoop.”</p>
<p>The problem surfaced again Monday in the form of complaints over Forrester’s new report titled <a href="http://www.forrester.com/rb/Research/wave&trade;_enterprise_hadoop_solutions,_q1_2012/q/id/60755/t/2?src=RSS_2&amp;cm_mmc=Forrester-_-RSS-_-Document-_-6">“Enterprise Hadoop Solution, Q1 2012.”</a><em> InformationWeek </em><a href="http://informationweek.com/news/software/info_management/232600283">spoke with a few vendors</a> that didn’t like how their products were assessed, and database industry analyst Curt Monash <a href="http://www.dbms2.com/2012/02/06/comments-on-the-2012-forrester-wave-enterprise-hadoop-solutions">says the report “compares apples, peaches, almonds, and peanuts.”</a> I thought the same thing when I saw a copy of the report last week. They all focus on Hadoop, but Hortonworks is not Datameer is not HStreaming.</p>
<p>Allow me to explain. Hopefully, this provides a foundation for parsing what people talk about when they talk about Hadoop, and for differentiating one type of product from another. (And you can learn even more about Hadoop and how it’s used at our <a href="http://event.gigaom.com/structuredata/?utm_source=cloud&amp;utm_medium=editorial&amp;utm_campaign=intext&amp;utm_term=481182+what-it-really-means-when-someone-says-hadoop&amp;utm_content=dharrisstructure">Structure: Data</a> conference taking place next month in New York City.)</p>
<h2>What Hadoop is</h2>
<p>I went into this in more detail in a <a href="http://pro.gigaom.com/2011/03/defining-hadoop-the-players-technologies-and-challenges-of-2011/?utm_source=cloud&amp;utm_medium=editorial&amp;utm_campaign=intext&amp;utm_term=481182+what-it-really-means-when-someone-says-hadoop&amp;utm_content=dharrisstructure">GigaOM Pro report published last March</a> (<strong>sub req’d</strong>), but the long and short is that Hadoop is, at its core, an <a href="http://hadoop.apache.org/">Apache Software Foundation project</a> consisting of two primary subprojects — <a href="http://hadoop.apache.org/mapreduce/">Hadoop MapReduce</a> and the <a href="http://hadoop.apache.org/hdfs/">Hadoop Distributed File System</a>. MapReduce is the parallel-processing engine that allows Hadoop to churn through large data sets in relatively short order. HDFS is the distributed file system that lets Hadoop scale across commodity servers and, importantly, store data on the compute nodes in order to boost performance (and potentially save money). These are the two must-have components for any Hadoop distribution.</p>
<p>There are also a number of Apache projects related to Hadoop, often built atop either Hadoop MapReduce or HDFS. These include — but are not limited to — <a href="http://hive.apache.org/">Hive</a> and <a href="http://pig.apache.org/">Pig</a>, two SQL-like query languages to provide data-warehouse-like capabilities to a Hadoop cluster, and <a href="http://hbase.apache.org/">HBase</a>, a NoSQL database that leverages HDFS as its distributed storage engine.</p>
<p><a href="http://gigaom2.files.wordpress.com/2012/02/hadoop-projects.jpg"><img title="hadoop projects" src="http://gigaom2.files.wordpress.com/2012/02/hadoop-projects.jpg?w=604&#038;h=198" alt="" width="604" height="198" class="aligncenter size-large wp-image-481309"></a></p>
<h2>Hadoop distributions</h2>
<p>These are packaged software products that aim to ease deployment and management of Hadoop clusters compared with simply downloading the various Apache code bases and trying to cobble together a system. Presently, <a href="http://gigaom.com/cloud/why-cloudera-isnt-sweating-the-hadoop-competition/">Cloudera</a>, <a href="http://gigaom.com/cloud/yahoo-spinoff-shakes-up-hadoop-market-with-new-distro/">Hortonworks</a>, <a href="http://gigaom.com/cloud/battle-on-mapr-cloudera-pimp-their-version-of-hadoop/">MapR</a> and <a href="http://gigaom.com/cloud/emc-throws-lots-of-hardware-at-hadoop/">EMC</a>  all offer their own Hadoop distributions. Although they’re all unique — sometimes very unique, as with MapR’s proprietary file system — they all package a set of Hadoop projects (MapReduce, Hive, Sqoop, Pig, etc.) in a way that in theory makes them integrate more naturally, and to run both smoothly and securely.</p>
<p>Many Hadoop distributions integrate with various data warehouses, databases and other data-management products, with the goal of moving data between Hadoop clusters and other environments so each might process or query data stored in the other.</p>
<h2>Hadoop management software</h2>
<p>Just as the wording implies, Hadoop management software is designed to make it easier to manage and troubleshoot a Hadoop cluster. Such products are usually sold or offered by companies peddling Hadoop distributions, because even when commercially packaged, Hadoop is still a complex architecture and somewhat foreign to most IT personnel and products. However, third parties such as <a href="http://gigaom.com/cloud/platform-computing-extends-hpc-reach-into-mapreduce/">Platform Computing</a> (now <a href="http://gigaom.com/cloud/ibm-eyes-big-data-at-big-banks-with-platform-buy/">part of IBM</a>) and <a href="http://gigaom.com/cloud/zettaset-raises-3m-for-the-consumerization-of-big-data/">Zettaset</a> also sell software for managing Hadoop clusters, and their products are typically agnostic as to what distributions they support.</p>
<p>But distributions and management software are all about the infrastructure and the platform. Anyone actually wanting to use Hadoop still needs to know how to write applications that leverage the underlying architecture.</p>
<h2>Hadoop application software (or, products that use Hadoop)</h2>
<p>The Hadoop ecosystem gets really complex when we start looking at products that exist to help developers write Hadoop applications or otherwise analyze data stored within Hadoop in a manner other than writing traditional MapReduce jobs. These range from abstraction layers such as <a href="http://karmasphere.com/index.php">Karmasphere Analyst</a> or <a href="http://gigaom.com/cloud/ibms-hadoop-effort-grows-from-project-to-product/">IBM Infosphere BigInsights</a>, to <a href="http://gigaom.com/cloud/hadapt-raises-9-5m-for-hadoop-data-warehouse/">Hadapt</a>, which offers a single-platform product fusing a SQL data warehouse with a Hadoop cluster, to <a href="http://www.hstreaming.com/">HStreaming</a>, which promises real-time processing and analytics.</p>
<p>The one common thing among all these products, however, is that they are not Hadoop distributions, but sit atop platform software from Hortonworks, EMC or whomever. Some products that get thrown into the Hadoop fray, such as <a href="http://outerthought.org/site/products/lily.html">Outerthought Lily</a> or <a href="http://drawntoscale.com/how_it_works.html">Drawn to Scale Spire</a>, are essentially scale-out databases built atop HBase (which itself is a separate project built atop HDFS). The image below, from Karmasphere, gives a particularly clear map of how a Hadoop environment might look.</p>
<p><a href="http://gigaom2.files.wordpress.com/2011/06/hadoopdatafabric-ks.jpeg"><img title="HadoopDataFabric-KS" src="http://gigaom2.files.wordpress.com/2011/06/hadoopdatafabric-ks.jpeg?w=604&#038;h=379" alt="" width="604" height="379" class="aligncenter size-large wp-image-369496"></a></p>
<p>The applications and analytics space is probably <a href="http://gigaom.com/cloud/5-low-profile-startups-that-could-change-the-face-of-big-data/">where we’ll see the biggest influx of new companies</a>, as writing Hadoop applications is still tough, but it’s also how companies will actually start experiencing direct business benefits. In fact, it’s these type of higher-level products that are the focal point of <a href="http://gigaom.com/cloud/accel-forms-100m-fund-to-feed-big-data-apps/">Accel Partners’ new big data fund</a>.</p>
<p><strong>Related research and analysis from GigaOM Pro:</strong><br />Subscriber content. <a href="http://pro.gigaom.com/?utm_source=cloud&utm_medium=editorial&utm_campaign=auto3&utm_term=481182+what-it-really-means-when-someone-says-hadoop&utm_content=dharrisstructure">Sign up for a free trial</a>.</p><ul><li><a href="http://pro.gigaom.com/2011/03/defining-hadoop-the-players-technologies-and-challenges-of-2011/?utm_source=cloud&utm_medium=editorial&utm_campaign=auto3&utm_term=481182+what-it-really-means-when-someone-says-hadoop&utm_content=dharrisstructure">Defining Hadoop: the Players, Technologies and Challenges of&nbsp;2011</a></li><li><a href="http://pro.gigaom.com/2012/01/how-amazons-dynamodb-is-rattling-the-big-data-and-cloud-markets/?utm_source=cloud&utm_medium=editorial&utm_campaign=auto3&utm_term=481182+what-it-really-means-when-someone-says-hadoop&utm_content=dharrisstructure">Amazon’s DynamoDB: rattling the cloud&nbsp;market</a></li><li><a href="http://pro.gigaom.com/2011/07/infrastructure-q2-big-data-and-paas-gain-more-momentum/?utm_source=cloud&utm_medium=editorial&utm_campaign=auto3&utm_term=481182+what-it-really-means-when-someone-says-hadoop&utm_content=dharrisstructure">Infrastructure Q2: Big data and PaaS gain more&nbsp;momentum</a></li></ul><img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=gigaom.com&amp;blog=14960843&amp;post=481182&amp;subd=gigaom2&amp;ref=&amp;feed=1" width="1" height="1" /><hr /><p>
	<a href='http://ads.gigaom.com/redirect/rss/'>
		<img 
			src='http://ads.gigaom.com/show/rss/' 
			alt=''
			border='0'
		/>
	</a>
</p>]]></content:encoded>
			<wfw:commentRss>http://cloud.ubuntu.com/2012/02/what-it-really-means-when-someone-says-%e2%80%98hadoop%e2%80%99/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
<enclosure url="http://0.gravatar.com/avatar/a5a578e0c178f533ff6edc2ffad670a1?s=96&amp;amp;d=retro&amp;amp;r=PG" length="" type="" />
<enclosure url="http://gigaom2.files.wordpress.com/2011/10/hadoop1.jpg" length="" type="" />
<enclosure url="http://gigaom2.files.wordpress.com/2012/02/hadoop-projects.jpg?w=604" length="" type="" />
<enclosure url="http://gigaom2.files.wordpress.com/2011/06/hadoopdatafabric-ks.jpeg?w=604" length="" type="" />
<enclosure url="http://gigaom2.files.wordpress.com/2011/10/hadoop-e1319488918182.jpg?w=210" length="" type="" />
		</item>
		<item>
		<title>juju can help your development team speed up iteration</title>
		<link>http://cloud.ubuntu.com/2012/02/juju-can-help-your-development-team-speed-up-iteration/</link>
		<comments>http://cloud.ubuntu.com/2012/02/juju-can-help-your-development-team-speed-up-iteration/#comments</comments>
		<pubDate>Thu, 02 Feb 2012 20:39:02 +0000</pubDate>
		<dc:creator>jorge</dc:creator>
				<category><![CDATA[Uncategorized]]></category>
		<category><![CDATA[cloud]]></category>
		<category><![CDATA[featured]]></category>
		<category><![CDATA[planet]]></category>

		<guid isPermaLink="false">http://cloud.ubuntu.com/?p=31515</guid>
		<description><![CDATA[In this blog post the Launchpad team uses juju to deploy oops-tool, a Django-based tool that aggregates bug reports for the Launchpad project. We typically talk about services that people commonly deploy, such as Mediawiki or WordPress. However there is another use case for juju that is just as powerful, as a tool to help [...]]]></description>
			<content:encoded><![CDATA[<p>In this <a href="http://blog.launchpad.net/general/how-to-do-juju-%E2%80%93-charming-oops-tools">blog post</a> the Launchpad team uses juju to deploy oops-tool, a Django-based tool that aggregates bug reports for the Launchpad project.</p>
<p>We typically talk about services that people commonly deploy, such as Mediawiki or WordPress. However there is another use case for juju that is just as powerful, as a tool to help iterate on whatever you&#8217;re working on <em>faster</em>. oops-tool is not a general tool that most people will want to use; it&#8217;s very specialized. </p>
<p>However the Launchpad team have encapsulated their service in a charm. Any person can now deploy oops-tool in 4 commands. Now have a think about a project you and your team might be working on and the complexities of that service and how wonderful it would be if any person on any team could deploy any service in your project&#8217;s code base with that kind of ease. You&#8217;re codifying the management of your service so that as you work on a feature branch you can deploy, test, and then iterate. </p>
<p>juju strives to deploy your service in the same way that people strive to have their software build in one set of processes, but it&#8217;s more than just that. Deploy-and-forget is nice, but being able to manage a service over its lifetime is what people need in the cloud and you can do that with a juju charm.</p>
<p>Launchpad has a myriad of services it provides, we&#8217;ll keep you in touch on how that team is using juju to simplify their processes. Got more questions about juju and how we can help you manage in the cloud? Feel free to <a href="https://juju.ubuntu.com/">Contact Us</a> and ask questions!</p>
]]></content:encoded>
			<wfw:commentRss>http://cloud.ubuntu.com/2012/02/juju-can-help-your-development-team-speed-up-iteration/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Why Open?</title>
		<link>http://cloud.ubuntu.com/2012/02/why-open/</link>
		<comments>http://cloud.ubuntu.com/2012/02/why-open/#comments</comments>
		<pubDate>Thu, 02 Feb 2012 19:11:00 +0000</pubDate>
		<dc:creator>MontyTaylor</dc:creator>
				<category><![CDATA[Uncategorized]]></category>
		<category><![CDATA[cloud]]></category>
		<category><![CDATA[featured]]></category>
		<category><![CDATA[ubuntucloud]]></category>

		<guid isPermaLink="false">http://h30529.www3.hp.com/t5/HP-Scaling-the-Cloud-Blog/Why-Open/ba-p/305</guid>
		<description><![CDATA[Because Open changes the conversation. Open alters the way the game is played. Open is what customers want.
In the early 1980's IBM trounced Digital, who were far and away the leaders in the world of computing. Instead of focusing on the vendor-specif...]]></description>
			<content:encoded><![CDATA[<img src="http://farm8.staticflickr.com/7153/6789612591_404980712d_m.jpg" alt="" width="203" height="240" align="left" border="0" hspace="12" vspace="8" />Because Open changes the conversation. Open alters the way the game is played. Open is what customers want.

In the early 1980's IBM trounced Digital, who were far and away the leaders in the world of computing. Instead of focusing on the vendor-specific lock-in oriented architecture DEC employed, IBM launched the PC. By the standards of the time, the PC was an astoundingly Open platform. Rivals such as Compaq and HP were not only allowed but even encouraged to make compatible systems. The resulting business environment declawed Digital and left it a set of aging plaques and photos in an HP cafeteria. HP, on the other hand, leads the global manufacturing of PC compatible computers.

Linux repeated the pattern by unseating Solaris, which truly was the dot in dot-com at its height. Linux gave customers a choice. Linux runs on hardware offerings from everyone, not just from one vendor. HP has reaped huge benefits from this by making excellent server products and by ensuring that nothing closed is required to use them.

<a href="http://h30529.www3.hp.com/t5/HP-Scaling-the-Cloud-Blog/Why-Open/ba-p/305">Read more...</a>]]></content:encoded>
			<wfw:commentRss>http://cloud.ubuntu.com/2012/02/why-open/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>The Hot Topic at SCALE: OpenStack</title>
		<link>http://cloud.ubuntu.com/2012/02/the-hot-topic-at-scale-openstack/</link>
		<comments>http://cloud.ubuntu.com/2012/02/the-hot-topic-at-scale-openstack/#comments</comments>
		<pubDate>Wed, 01 Feb 2012 19:10:03 +0000</pubDate>
		<dc:creator>MargotRudell</dc:creator>
				<category><![CDATA[Uncategorized]]></category>
		<category><![CDATA[cloud]]></category>
		<category><![CDATA[featured]]></category>
		<category><![CDATA[openstack]]></category>
		<category><![CDATA[ubuntucloud]]></category>

		<guid isPermaLink="false">http://h30529.www3.hp.com/t5/HP-Scaling-the-Cloud-Blog/The-Hot-Topic-at-SCALE-OpenStack/ba-p/299</guid>
		<description><![CDATA[The biggest topic at this year&#8217;s Southern California Linux Expo (SCALE) conference was the OpenStackTM project. Everyone came away from the show appreciating that OpenStack is only going to get more popular and bigger. OpenStack is building momen...]]></description>
			<content:encoded><![CDATA[<img src="http://farm8.staticflickr.com/7014/6802342035_2fca2fc0b4.jpg" alt="" width="364" height="483" align="left" border="0" hspace="12" vspace="8" />The biggest topic at this year’s Southern California Linux Expo (SCALE) conference was the OpenStack<sup>TM</sup> project. Everyone came away from the show appreciating that OpenStack is only going to get more popular and bigger. OpenStack is building momentum. Jim Ash and Andrei Matei from the HP Cloud Services team stayed busy – talking with and signing up people for our private beta (HP Cloud Compute and HP Cloud Object Storage). To the SCALE attendees, who gave us their opinions, HP’s involvement with OpenStack means that OpenStack will be a serious, viable option for businesses of all sizes and for developers – who want a real choice in the market that competes with the existing proprietary cloud options.

People at the conference wanted to know more about the links between HP, OpenStack technology, Linux, and other open source projects. In a nutshell, OpenStack technology is the open source, open API, open development, and open orchestration layer powering HP Cloud Services. And OpenStack technology is built on Linux and open source technology. The OpenStack project and offerings like HP Cloud Services that integrate OpenStack technology bring open source technology and ideals to businesses of all sizes. We were excited about the warm reception HP Cloud Services got from people with a broad range of backgrounds in Linux and cloud – and from developers from all kinds of companies, from the smallest organizations to the largest enterprises.

<a href="http://h30529.www3.hp.com/t5/HP-Scaling-the-Cloud-Blog/The-Hot-Topic-at-SCALE-OpenStack/ba-p/299">Read more…</a>]]></content:encoded>
			<wfw:commentRss>http://cloud.ubuntu.com/2012/02/the-hot-topic-at-scale-openstack/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Why 2013 is the year of ‘NoOps’ for programmers [Infographic]</title>
		<link>http://cloud.ubuntu.com/2012/02/why-2013-is-the-year-of-%e2%80%98noops%e2%80%99-for-programmers-infographic/</link>
		<comments>http://cloud.ubuntu.com/2012/02/why-2013-is-the-year-of-%e2%80%98noops%e2%80%99-for-programmers-infographic/#comments</comments>
		<pubDate>Tue, 31 Jan 2012 23:00:57 +0000</pubDate>
		<dc:creator>Derrick Harris</dc:creator>
				<category><![CDATA[application development]]></category>
		<category><![CDATA[cloud computing]]></category>
		<category><![CDATA[Cloud Foundry]]></category>
		<category><![CDATA[developers]]></category>
		<category><![CDATA[featured]]></category>
		<category><![CDATA[heroku]]></category>
		<category><![CDATA[infrastructure as a service]]></category>
		<category><![CDATA[Platform as a Service]]></category>
		<category><![CDATA[VMWare]]></category>

		<guid isPermaLink="false">http://gigaom.com/?p=478749</guid>
		<description><![CDATA[AppFog CEO Lucas Carlson isn't shy about touting PaaS as the ideal way for developers to access cloud computing resources, but he also knows it's not mainstream. In this inforgraphic illustrating the evolution of cloud computing, Carlson says PaaS will hit its stride in 2013.<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=gigaom.com&#38;blog=14960843&#38;post=478749&#38;subd=gigaom2&#38;ref=&#38;feed=1" width="1" height="1" />]]></description>
			<content:encoded><![CDATA[<div id="attachment_369833" class="wp-caption alignleft" style="width: 310px"><a href="http://gigaom2.files.wordpress.com/2011/06/lucas_carlson.jpg"><img  title="lucas_carlson" src="http://gigaom2.files.wordpress.com/2011/06/lucas_carlson.jpg?w=604" alt=""   class="size-full wp-image-369833" /></a><p class="wp-caption-text">Lucas Carlson, CEO AppFog</p></div>
<p><a href="http://appfog.com">AppFog</a> Founder and CEO Lucas Carlson isn&#8217;t shy about touting platform-as-a-service as the ideal way for developers to access cloud computing resources, but he isn&#8217;t blind either. Although PaaS has been around for a couple years now and has already <a href="http://gigaom.com/cloud/salesforce-buys-herokus-ruby-cloud-for-212-million/">spurred hundreds of millions in M&amp;A spending</a>, Carlson knows it&#8217;s nowhere near the mainstream yet.</p>
<p>Carlson lays out his version of the evolution of cloud computing in the infographic below. Right now, API-based infrastructure-as-a-service offerings like that from Amazon Web Services and SysOps (or DevOps) tools are developers&#8217; best friends in the cloud. Application-lifecycle platforms such as Cloud Foundry (the <a href="http://gigaom.com/cloud/cloud-foundry-lets-apps-span-cloud-providers/">VMware-ran open source project</a>  <a href="http://gigaom.com/cloud/cloud-foundry-adds-php-python-appfog-now-a-user/">on which AppFog is built</a>) and <a href="http://gigaom.com/cloud/red-hat-automates-more-java-dev-in-openshift-paas/">Red Hat&#8217;s OpenShift</a>  are poised to reach critical mass in 2012, whereas so-called &#8220;NoOps&#8221; platforms such as AppFog and Heroku will reach that point in 2013.</p>
<p>During a recent phone call, Carlson told me PaaS is the model of the future, not the present, because only about 2 to 4 percent of developers &#8212; the ones on the cutting edge &#8212; are actually using it right now. &#8220;As interesting as PaaS is, the majority of developers … have some very real concerns that are holding them back from actually going forward,&#8221; Carlson said.</p>
<p>Aside from illustrating the evolution of cloud-development tools, Carlson said the infographic also aims to clearly delineate the different layers of the cloud stack, something he <a href="http://blog.appfog.com/atomic-units-for-a-company/">opined on in a December blog post</a>. PaaS isn&#8217;t a feature of IaaS, he explained, but &#8220;a full reinvention from the ground up.&#8221; Every layer has to fully understand the layers below because they must manage them, but the user experience and the resulting increase in developer productivity are what make the service.</p>
<p><a href="http://gigaom2.files.wordpress.com/2012/01/appfog_infographic_013012.jpg"><img  title="appfog_infographic_013012" src="http://gigaom2.files.wordpress.com/2012/01/appfog_infographic_013012.jpg?w=604&#038;h=4000" alt="" width="604" height="4000" class="aligncenter size-full wp-image-478763" /></a></p>
<p><strong>Related research and analysis from GigaOM Pro:</strong><br />Subscriber content. <a href="http://pro.gigaom.com/?utm_source=cloud&utm_medium=editorial&utm_campaign=auto3&utm_term=478749+why-2013-is-the-year-of-noops-for-programmers-infographic&utm_content=dharrisstructure">Sign up for a free trial</a>.</p><ul><li><a href="http://pro.gigaom.com/2012/01/how-amazons-dynamodb-is-rattling-the-big-data-and-cloud-markets/?utm_source=cloud&utm_medium=editorial&utm_campaign=auto3&utm_term=478749+why-2013-is-the-year-of-noops-for-programmers-infographic&utm_content=dharrisstructure">Amazon’s DynamoDB: rattling the cloud&nbsp;market</a></li><li><a href="http://pro.gigaom.com/2011/04/infrastructure-q1-iaas-comes-down-to-earth-big-data-takes-flight/?utm_source=cloud&utm_medium=editorial&utm_campaign=auto3&utm_term=478749+why-2013-is-the-year-of-noops-for-programmers-infographic&utm_content=dharrisstructure">Infrastructure Q1: IaaS Comes Down to Earth; Big Data Takes&nbsp;Flight</a></li><li><a href="http://pro.gigaom.com/2010/07/infrastructure-overview-q2-2010/?utm_source=cloud&utm_medium=editorial&utm_campaign=auto3&utm_term=478749+why-2013-is-the-year-of-noops-for-programmers-infographic&utm_content=dharrisstructure">Infrastructure Overview, Q2&nbsp;2010</a></li></ul><img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=gigaom.com&amp;blog=14960843&amp;post=478749&amp;subd=gigaom2&amp;ref=&amp;feed=1" width="1" height="1" /><hr /><p>
	<a href='http://ads.gigaom.com/redirect/rss/'>
		<img 
			src='http://ads.gigaom.com/show/rss/' 
			alt=''
			border='0'
		/>
	</a>
</p>]]></content:encoded>
			<wfw:commentRss>http://cloud.ubuntu.com/2012/02/why-2013-is-the-year-of-%e2%80%98noops%e2%80%99-for-programmers-infographic/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
<enclosure url="http://gigaom2.files.wordpress.com/2012/01/appfog_infographic_013012.jpg" length="" type="" />
<enclosure url="http://gigaom2.files.wordpress.com/2011/06/lucas_carlson.jpg" length="" type="" />
<enclosure url="http://0.gravatar.com/avatar/a5a578e0c178f533ff6edc2ffad670a1?s=96&amp;amp;d=retro&amp;amp;r=PG" length="" type="" />
<enclosure url="http://gigaom2.files.wordpress.com/2012/01/appfog_infographic_0130121-e1328040493916.jpg?w=199" length="" type="" />
		</item>
	</channel>
</rss>

