<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	>

<channel>
	<title>Permabits and Petabytes</title>
	<atom:link href="http://blog.permabit.com/index.php?feed=rss2" rel="self" type="application/rss+xml" />
	<link>http://blog.permabit.com</link>
	<description>OEM Data Optimization Solutions for Next Generation Storage</description>
	<pubDate>Fri, 03 Sep 2010 00:56:20 +0000</pubDate>
	<generator>http://wordpress.org/?v=2.7</generator>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
			<item>
		<title>VMware Performance and Deduplication</title>
		<link>http://blog.permabit.com/index.php/2010/09/vmware-performance-and-deduplication/</link>
		<comments>http://blog.permabit.com/index.php/2010/09/vmware-performance-and-deduplication/#comments</comments>
		<pubDate>Wed, 01 Sep 2010 20:17:55 +0000</pubDate>
		<dc:creator>Louis Imershein</dc:creator>
		
		<category><![CDATA[Louis Imershein, Director PM]]></category>

		<category><![CDATA[Albireo]]></category>

		<category><![CDATA[dedupe]]></category>

		<category><![CDATA[deduplication]]></category>

		<category><![CDATA[enterprise storage]]></category>

		<category><![CDATA[ESXi]]></category>

		<category><![CDATA[primary storage]]></category>

		<category><![CDATA[primary storage deduplication]]></category>

		<category><![CDATA[VDI]]></category>

		<category><![CDATA[virtualization]]></category>

		<category><![CDATA[VMware]]></category>

		<category><![CDATA[vSphere]]></category>

		<guid isPermaLink="false">http://blog.permabit.com/?p=1012</guid>
		<description><![CDATA[Last week I talked about deduplication and its advantages for VMware environments.  One question I heard from a reader, &#8220;if the savings are as great as you&#8217;re saying, why isn&#8217;t deduplication in VMware environments everywhere today?&#8221;  And on the surface, I think the answer is simple, the key challenge to VMware deduplication is performance.   IT [...]]]></description>
			<content:encoded><![CDATA[<p><a href="http://blog.permabit.com/index.php/2010/08/the-case-for-vmware-and-deduplication/" >Last week</a> I talked about deduplication and its advantages for VMware environments.  One question I heard from a reader, &#8220;if the savings are as great as you&#8217;re saying, why isn&#8217;t deduplication in VMware environments everywhere today?&#8221;  And on the surface, I think the answer is simple, the key challenge to VMware deduplication is <em>performance</em>.   IT organizations have come to understand how deduplication saves on storage costs, but they are generally concerned about any resulting performance impact.   The good news is that <a href="http://www.permabit.com/albireo/albireo-overview.asp" >Albireo</a> is specifically designed for highly efficient primary storage deduplication.  <span id="more-1012"></span></p>
<p>Albireo&#8217;s unique design provides efficient deduplication without any performance penalty because Albireo runs entirely out of the read path.  Albireo&#8217;s role is to provide deduplication advice to existing storage subsystems. Simply and elegantly, Albireo takes advantage of the built-in capabilities of modern file system and block virtualization layers provided by all mainstream storage vendors.  With this design, Albireo creates no more performance impact than when common features such as clones and snapshots are in use on the storage system.</p>
<p>So what about ingestion speeds?   This is where Albireo shines.   The <a href="http://www.permabit.com/albireo/grid.asp" >grid-based capabilities</a> of the Albireo index allow it to be scaled across multiple independent CPU resources to provide deduplication advice on ingest at breakneck speeds.  While Albireo is rated in excess of 140 MB/sec per processor core on a 3.0 GHz Intel Xeon even when operating with a small 4 KB chunk size, performance is even greater with larger chunk sizes. With 64KB chunks we&#8217;ve seen Albireo Grid-based dedupe scale to support ingest speeds over 5.5GB/s.   These speeds are more than adequate to meet the needs of even the most intensive VMware environments. So there is no performance impact or latency as has been identified in early stage deduplication backup offerings.</p>
<p>What&#8217;s more, Albireo allows storage vendors to give their customers choice.  With Albireo, an OEM can chose <a href="http://www.permabit.com/albireo/processing.asp" >to implement  deduplication</a> inline, in parallel, and/or as a post-process event - providing the greatest flexibility for performance intensive environments while simplifying manageability.  This means that in the latest powerful storage equipment deduplication can be performed inline with no performance impact, while in more resource constrained storage hardware, customers simply schedule deduplication to run at off hours.  Either way, there is no performance impact to the user environment.</p>
<p>Finally, I wanted to talk a little about a way in which deduplication can actually improve read performance.   While Albireo itself doesn&#8217;t sit in the read path, it does provide advice which allows blocks in the read path to be shared.   Like VMware <a href="http://kb.vmware.com/selfservice/microsites/search.do?language=en_US&amp;cmd=displayKC&amp;externalId=1021095" onclick="javascript:pageTracker._trackPageview('/outbound/article/kb.vmware.com');">Transparent Page Sharing</a> (TPS), Albireo looks at new data and provides advice on how to eliminate duplicate blocks using existing capabilities of modern storage systems. These existing capabilities maintain deduplicated blocks in the read cache, reducing the amount of memory needed for caching the operating systems and applications stored in VMware environments.  The result is improved overall performance when reading these shared blocks from disk.   This becomes especially important in virtual desktop deployments, where many users logging in at the same time can result in <a href="http://blog.permabit.com/index.php/2010/07/virtual-desktops-the-bigger-beast/" >boot storms</a>.  In these environments, deduplication can reduce the impact of boot storms by up to 50%.</p>
]]></content:encoded>
			<wfw:commentRss>http://blog.permabit.com/index.php/2010/09/vmware-performance-and-deduplication/feed/</wfw:commentRss>
		</item>
		<item>
		<title>Cherry Garcia and Deduplication</title>
		<link>http://blog.permabit.com/index.php/2010/09/cherry-garcia-and-deduplication/</link>
		<comments>http://blog.permabit.com/index.php/2010/09/cherry-garcia-and-deduplication/#comments</comments>
		<pubDate>Wed, 01 Sep 2010 14:49:25 +0000</pubDate>
		<dc:creator>Wayne Salpietro</dc:creator>
		
		<category><![CDATA[Wayne Salpietro, Director PMM]]></category>

		<category><![CDATA[Albireo]]></category>

		<category><![CDATA[capex]]></category>

		<category><![CDATA[data optimization]]></category>

		<category><![CDATA[dedupe]]></category>

		<category><![CDATA[enterprise archive]]></category>

		<category><![CDATA[latency]]></category>

		<category><![CDATA[memory efficiency]]></category>

		<category><![CDATA[performance]]></category>

		<category><![CDATA[primary dedupe]]></category>

		<category><![CDATA[primary storage]]></category>

		<category><![CDATA[primary storage deduplication]]></category>

		<category><![CDATA[scalability]]></category>

		<category><![CDATA[sub-file dedupe]]></category>

		<category><![CDATA[ZFS]]></category>

		<category><![CDATA[ZFS Deduplication]]></category>

		<guid isPermaLink="false">http://blog.permabit.com/?p=992</guid>
		<description><![CDATA[ 
Deduplication (data optimization) is a hot topic these days because its promise to reduce storage consumption and its associated costs is compelling to today&#8217;s high data growth businesses. Impact on IT budgets, CAPEX and OPEX as a result of applying dedupe in primary and/or secondary tiers of storage can be substantial!
Lost in all the [...]]]></description>
			<content:encoded><![CDATA[<p align="center"><strong> </strong></p>
<p>Deduplication (data optimization) is a hot topic these days because its promise to reduce storage consumption and its associated costs is compelling to today&#8217;s high data growth businesses. Impact on IT budgets, CAPEX and OPEX as a result of applying dedupe in primary and/or secondary tiers of storage can be substantial!</p>
<p>Lost in all the hype is the fact that there are as many flavors of dedupe as Ben &amp; Jerry&#8217;s has flavors of ice cream! <span id="more-992"></span>They are all different and provide varied results. There are a few vectors to pivot around in this discussion such as file vs. sub-file, memory efficiency and scalability.   Each are part of the puzzle vendors bring together to help reduce the amount of data and associated storage space consumed. Unfortunately, these puzzle pieces determine the effectiveness of the dedupe offering yielding a high performance solution vs. a minimal capability solution.  Just like a sports car compared to a VW Beetle. Both will provide a service, it&#8217;s a matter of the results!</p>
<p>In each characteristic, the impact and yield really make a difference;</p>
<p>* File      vs. Sub-file - as data is analyzed the smaller the chunk size the higher      the deduplication yield. So deduping at the file level yields the least      dedupe benefit while deduping at the smaller (sub-file) data chunk level (4K      is optimal) yields the largest benefit.</p>
<p>* Memory      efficiency is critical since the more hash keys you can keep in memory the      better. The closer you can function to 100% &#8216;in memory,&#8217; the better the      overall performance will be!</p>
<p>* Not      too far behind memory efficiency is the ability to sample the most amount      of data. Again, the more efficient the better and the higher the scalability      will be.</p>
<p>Here&#8217;s a case in point. Permabit&#8217;s Albireo High Performance Data Optimization solution recently changed the deduplication market as it applies to primary storage data optimization (dedupe) and has received substantial press and analyst recognition.  Contrasting Albireo is the Open ZFS Deduplication technology which has been used as a base for some storage optimization solutions. Let&#8217;s compare the two:</p>
<p>* ZFS      uses a 128K chunk size vs. Albireo using as small as a 4K chunk (it&#8217;s      configurable).  Albireo will find      substantially more duplicates and reduce storage consumption dramatically      compared to ZFS</p>
<p>* ZFS      uses a table entry approach for data identification at 200 bytes each vs.      Albireo using an index with data representation of only 4 bytes each.  Albireo will typically reside in memory      99.5% of the time. ZFS on the other hand, will page out to disk (resulting      in major performance slow down) for table look-up more than 50% of the time      as a result of the 200 byte data id (50 times the size of Albireo).</p>
<p>* ZFS      can scale out to 20TB using 32 GB of RAM vs. Albireo which can scale to 640TB      using only 16GB of RAM (64 times greater scalability with Albireo)!</p>
<p>Albireo will be more effective at finding duplicates, more efficient at processing the data and more scalable.   The contrast is significant because businesses do run on the data that is being deduped and the impact on the overall storage costs will be substantial. The higher performance, the more efficient and the more scalable one flavor of dedupe is over another, the better! Like Cherry Garcia is over all those other flavors, Albireo is to the rest of the dedupe offerings!</p>
]]></content:encoded>
			<wfw:commentRss>http://blog.permabit.com/index.php/2010/09/cherry-garcia-and-deduplication/feed/</wfw:commentRss>
		</item>
		<item>
		<title>Interoperability is OK, but Optimization is Optimal</title>
		<link>http://blog.permabit.com/index.php/2010/08/interoperability-is-ok-but-optimization-is-optimal/</link>
		<comments>http://blog.permabit.com/index.php/2010/08/interoperability-is-ok-but-optimization-is-optimal/#comments</comments>
		<pubDate>Tue, 31 Aug 2010 14:26:08 +0000</pubDate>
		<dc:creator>Tom Cook</dc:creator>
		
		<category><![CDATA[Tom Cook, CEO]]></category>

		<category><![CDATA[3PAR]]></category>

		<category><![CDATA[Albireo]]></category>

		<category><![CDATA[BlueArc]]></category>

		<category><![CDATA[dedupe]]></category>

		<category><![CDATA[deduplication]]></category>

		<category><![CDATA[NetApp]]></category>

		<category><![CDATA[VMware]]></category>

		<guid isPermaLink="false">http://blog.permabit.com/?p=983</guid>
		<description><![CDATA[We are at VMworld this week and it should be interesting as VMware continues to press their aggressive growth agenda. We are here because our OEM data optimization solution, Albireo dedupes VMDK and VDI environments like nobodies&#8217; business (up to 100X) and this is a great week to talk with OEM customers and vendors who [...]]]></description>
			<content:encoded><![CDATA[<p>We are at <a href="http://www.vmworld.com/index.jspa" onclick="javascript:pageTracker._trackPageview('/outbound/article/www.vmworld.com');">VMworld</a> this week and it should be interesting as VMware continues to press their aggressive growth agenda. We are here because our OEM data optimization solution, <a href="http://www.permabit.com/albireo/albireo-overview.asp" >Albireo</a> <a href="http://www.permabit.com/albireo/solutions.asp" >dedupes VMDK</a> and VDI environments like nobodies&#8217; business (up to 100X) and this is a great week to talk with OEM customers and vendors who are serious about storage efficiency.</p>
<p>We have two announcements at the show. The first is around Permabit extending our <a href="http://www.permabit.com/pressreleases/permabit-joins-vmware-tech-pntr-prog.asp" >VMware Alliance</a> to the Select level.  That means we are partnering more closely with VMware and their ecosystem. We are the only OEM data optimization partner of VMware.  Our second announcement is of our new <a href="http://www.permabit.com/pressreleases/permabit-bluearc-ann-oem-relation.asp" >OEM relationship with BlueArc</a>. Interestingly, the VMworld theme of every storage vendor, save two, is about &#8216;interoperability&#8217;. While good, interoperability or integrating with the VM management console isn&#8217;t that exciting to me. The real value is in optimizing the storage of VM. That brings me back to my main point - only two storage companies are talking about optimizing VMware environments - <a href="http://www.netapp.com/us/solutions/infrastructure/virtualization/server/server-vmware.html" onclick="javascript:pageTracker._trackPageview('/outbound/article/www.netapp.com');">NetApp</a> and our new OEM partner <a href="http://www.bluearc.com/" onclick="javascript:pageTracker._trackPageview('/outbound/article/www.bluearc.com');">BlueArc</a> and that sets them apart from the rest of the storage crowd.</p>
<p><span id="more-983"></span>A couple of weeks ago the subject of how important data optimization is to NetApp&#8217;s growth came up during a discussion with a prominent analyst. He attributed NetApp&#8217;s growth to two factors: adopting primary data optimization and hitching their wagon to VMware. In effect he said, by implementing a dedupe solution and becoming the preferred storage for a software vendor exhibiting 40% growth (VMware), NetApp was achieving fantastic growth.</p>
<p>That made me think, why isn&#8217;t NetApp being as highly valued as <a href="http://www.3par.com/index.html" onclick="javascript:pageTracker._trackPageview('/outbound/article/www.3par.com');">3PAR</a> by the market? I mean they are growing faster and have a more complete product offering.  So, why aren&#8217;t they getting as high a multiple? Hmm&#8230;could it be because the market doesn&#8217;t believe NetApp&#8217;s VMware driven growth will continue, that it is vulnerable and will come under strong competition? I can confirm VMDK/VDI optimization is a strong driver for our OEM customers, so I do see optimization rather than interoperability becoming table stakes at the VMware game.</p>
<p>We welcome BlueArc as a new OEM partner. Their high performance NAS offering will soon be even higher value and a perfect solution for VMware environments. My prediction is BlueArc and their partners will nick away at NetApp&#8217;s market leadership by offering the highest performing primary dedupe solution, Albireo and gain share in the high growth VMware market. Good luck to Gus and the BlueArc team.</p>
]]></content:encoded>
			<wfw:commentRss>http://blog.permabit.com/index.php/2010/08/interoperability-is-ok-but-optimization-is-optimal/feed/</wfw:commentRss>
		</item>
		<item>
		<title>Latency, Latency, Latency&#8230;</title>
		<link>http://blog.permabit.com/index.php/2010/08/latency-latency-latency/</link>
		<comments>http://blog.permabit.com/index.php/2010/08/latency-latency-latency/#comments</comments>
		<pubDate>Wed, 25 Aug 2010 19:33:04 +0000</pubDate>
		<dc:creator>Wayne Salpietro</dc:creator>
		
		<category><![CDATA[Wayne Salpietro, Director PMM]]></category>

		<category><![CDATA[data optimization]]></category>

		<category><![CDATA[dedupe]]></category>

		<category><![CDATA[deduplication]]></category>

		<category><![CDATA[enterprise storage]]></category>

		<category><![CDATA[latency]]></category>

		<category><![CDATA[primary dedupe]]></category>

		<category><![CDATA[primary storage]]></category>

		<category><![CDATA[primary storage deduplication]]></category>

		<guid isPermaLink="false">http://blog.permabit.com/?p=969</guid>
		<description><![CDATA[ 
A few comments have been made regarding latency, most recently in a Search Storage piece from Dave Raffo, when comparisons were being made between compression and deduplication applied to primary storage. The concern expressed is that deduplication may be inflicting latency in the write or read process for primary storage. That may have been [...]]]></description>
			<content:encoded><![CDATA[<p align="center"><strong> </strong></p>
<p>A few comments have been made regarding latency, most recently in a <a href="http://searchstorage.techtarget.com/news/article/0,289142,sid5_gci1519051,00.html?track=sy60" onclick="javascript:pageTracker._trackPageview('/outbound/article/searchstorage.techtarget.com');">Search Storage piece from Dave Raffo</a>, when comparisons were being made between compression and deduplication applied to primary storage. The concern expressed is that deduplication may be inflicting latency in the write or read process for primary storage. That may have been an astute observation a year ago but today its simply just not a valid observation!<span id="more-969"></span></p>
<p>At issue is the performance of primary storage which needs to be as efficient as possible since businesses are run on primary storage that houses transactional data for core applications such as finance, manufacturing, CRM, sales operations, and supply chain to mention just a few. For these applications any significant delay in data access is simply not tolerable because businesses run on them! With today&#8217;s high performance data optimization (deduplication) tools such as Permabit&#8217;s Albireo there is no latency on data access because Albireo is NOT IN THE READ PATH!  And for that matter Albireo is not in the write path for all but in inline implementations.</p>
<p>In a recent blog post <a href="../../../../../index.php/2010/08/not-in-the-read-path-or-performance-matters/">NOT in the READ Path</a> I go into some of the details <em>&#8216;As we designed and developed Albireo, we specifically addressed the most critical of all performance criteria - data read. Why is it significant? Because data is usually written once, but can be read hundreds, thousands or millions of times clearly affecting the performance of the storage.  As a result, <strong>Albireo is NOT in the read path &#8212; so there is no performance impact on read and no latency! </strong></em></p>
<p><em>In the write process Albireo&#8217;s flexibility enables inline, parallel and post-process approaches (or any combination of the three) so the storage vendor can choose which approach they prefer. Let&#8217;s look at each of these:</em></p>
<p><em> </em></p>
<p>* <em>Of course in an inline approach, there is <strong>a very small write latency</strong> as Albireo does a hash key create and look-up. When we say very small, we&#8217;re talking a few microseconds. Can a user detect a few microsecond delay on write?  I think not!</em></p>
<p>* <em>In a <strong>parallel approach. there is NO performance impact on write</strong> at all because the vendor continues to write the data as they normally do.  Albireo provides duplicate advice to the storage and the vendor can apply it when cycles permit. </em></p>
<p>* <em>In a <strong>post-process deduplication approach, there is NO performance write impact</strong> because a write is done as the vendor normally would and a post-process deduplication can be initiated as cycles are available.&#8217; </em></p>
<p><strong> </strong></p>
<p>Albireo is a new approach to data deduplication that is embedded into the storage stack using patented indexing, memory utilization and hash key technologies enabling scale out and performance that until Albireo has not been seen in other deduplication implementations.  Previous iterations of dedupe, which have primarily been used for backup, had their idiosyncrasies such as performance, latency, scale-out and memory usage to name but a few. These do not apply in Albireo&#8217;s implementation because we have solved these issues as we developed Albireo.</p>
<p>In summary, using Albireo High Performance Data Optimization (deduplication) in primary storage does not introduce latency!</p>
]]></content:encoded>
			<wfw:commentRss>http://blog.permabit.com/index.php/2010/08/latency-latency-latency/feed/</wfw:commentRss>
		</item>
		<item>
		<title>The Case for VMware and Deduplication</title>
		<link>http://blog.permabit.com/index.php/2010/08/the-case-for-vmware-and-deduplication/</link>
		<comments>http://blog.permabit.com/index.php/2010/08/the-case-for-vmware-and-deduplication/#comments</comments>
		<pubDate>Wed, 25 Aug 2010 16:03:06 +0000</pubDate>
		<dc:creator>Louis Imershein</dc:creator>
		
		<category><![CDATA[Louis Imershein, Director PM]]></category>

		<category><![CDATA[Albireo]]></category>

		<category><![CDATA[dedupe]]></category>

		<category><![CDATA[deduplication]]></category>

		<category><![CDATA[enterprise storage]]></category>

		<category><![CDATA[primary storage]]></category>

		<category><![CDATA[primary storage deduplication]]></category>

		<category><![CDATA[virtualization]]></category>

		<category><![CDATA[VMware]]></category>

		<guid isPermaLink="false">http://blog.permabit.com/?p=961</guid>
		<description><![CDATA[We&#8217;ve all watched over the past several years as VMware virtualization has revolutionized IT, instigating the most significant consolidation of computer hardware assets in history. What you may not know is that VMware deployments offer one of the most attractive use cases for data optimization in a primary storage environment. A great example of this [...]]]></description>
			<content:encoded><![CDATA[<p>We&#8217;ve all watched over the past several years as <a href="http://www.vmware.com" onclick="javascript:pageTracker._trackPageview('/outbound/article/www.vmware.com');">VMware</a> virtualization has revolutionized IT, instigating the most significant consolidation of computer hardware assets in history. What you may not know is that VMware deployments offer one of the most attractive use cases for data optimization in a primary storage environment. A great example of this comes from our friends over at <a href="http://www.netapp.com" onclick="javascript:pageTracker._trackPageview('/outbound/article/www.netapp.com');">NetApp</a>®. Shortly after releasing their deduplication feature, NetApp&#8217;s customers began to report tremendous success with deduplicating VMware virtual machines (VMs). Their users were seeing storage savings of 50% or more with little performance impact and some were seeing savings approaching 90%. Let&#8217;s take a look at how primary storage deduplication works in VMware environments with Permabit&#8217;s <a href="http://www.permabit.com/albireo/albireo-overview.asp" >Albireo</a>™ OEM High Performance Data Optimization software.<span id="more-961"></span></p>
<p>Deduplication systems work by identifying and eliminating duplicate chunks of data. With Albireo, deduplication gets implemented by the OEM at the shared-storage, logical volume or filesystem level - the larger the shared pool of data on the back-end, the greater the potential for savings from redundant data. <a href="http://www.permabit.com/albireo/deployment.asp" >Albireo&#8217;s flexible deployment options </a>enable deduplication to occur inline, parallel or as a post-process event to address varying performance requirements.  We&#8217;re working with some OEMs who are implementing all three options on the same systems.  Regardless of where or when deduplication is implemented, the process is the same; duplicate chunks are eliminated and data pointers are modified to share a single data chunk on disk.   Albireo&#8217;s job is to identify those duplicates as quickly as possible, while utilizing as few system resources as possible!</p>
<p>To help simplify the management of consolidated environments, VMware server virtualization allows administrators to create &#8216;template operating environments&#8217;, each with a standardized operating system and application environment. These templates are then &#8216;cloned&#8217; into separate VMware images and installed as &#8216;guests&#8217; on a physical server. The result is savings from simplified configuration management. Today, with modern processors from <a href="http://www.amd.com" onclick="javascript:pageTracker._trackPageview('/outbound/article/www.amd.com');">AMD </a>and <a href="http://www.intel.com" onclick="javascript:pageTracker._trackPageview('/outbound/article/www.intel.com');">Intel</a>, I&#8217;m seeing VMware users running modern server environments with 12 VM guests per physical server. I have heard of desktop virtualization environments with over 60 &#8216;guests&#8217; per server. While VMware substantially reduced server costs, it has done nothing to help storage administrators who are left with the same number of operating system images on what is now a complex mixture of shared-storage resources.</p>
<p>It&#8217;s important to realize that while VMware provides a valuable cost benefit from consolidating IT servers, it offers no means for consolidating the storage used by VMware &#8216;<a href="http://pubs.vmware.com/vi35/admin/BSA_Templates.17.1.html" onclick="javascript:pageTracker._trackPageview('/outbound/article/pubs.vmware.com');">clones</a>&#8216;.  That&#8217;s where deduplication comes in. You see it turns out that each of these &#8216;cloned&#8217; VMware images starts off by taking up as much space as the template from which it was created. The set of data chunks making up a &#8216;<a href="http://pubs.vmware.com/vi35/admin/BSA_Templates.17.1.html" onclick="javascript:pageTracker._trackPageview('/outbound/article/pubs.vmware.com');">template</a>&#8216; and each of its &#8216;clones&#8217; are nearly identical. When Albireo see&#8217;s these duplicate &#8216;chunks&#8217; it identifies them and allows the storage system to replace them with pointers to the original data. Its easy to see why in a 12 VM guest environment, there&#8217;s nearly a 12:1 savings even after accounting for personalization of each guest. For a 60 VM Desktop environment, there&#8217;s a potential for almost 60:1 savings!</p>
<p>Next week I&#8217;ll talk a little about how Albireo can actually help VMware administrators better meet their storage performance requirements.</p>
]]></content:encoded>
			<wfw:commentRss>http://blog.permabit.com/index.php/2010/08/the-case-for-vmware-and-deduplication/feed/</wfw:commentRss>
		</item>
		<item>
		<title>Data Safety - It&#8217;s the ONLY issue</title>
		<link>http://blog.permabit.com/index.php/2010/08/data-safety-its-the-only-issue/</link>
		<comments>http://blog.permabit.com/index.php/2010/08/data-safety-its-the-only-issue/#comments</comments>
		<pubDate>Tue, 24 Aug 2010 18:33:25 +0000</pubDate>
		<dc:creator>Wayne Salpietro</dc:creator>
		
		<category><![CDATA[Wayne Salpietro, Director PMM]]></category>

		<category><![CDATA[data optimization]]></category>

		<category><![CDATA[data protection]]></category>

		<category><![CDATA[data safety]]></category>

		<category><![CDATA[deduplication]]></category>

		<category><![CDATA[enterprise storage]]></category>

		<category><![CDATA[primary storage]]></category>

		<category><![CDATA[primary storage deduplication]]></category>

		<guid isPermaLink="false">http://blog.permabit.com/?p=959</guid>
		<description><![CDATA[In my last two posts, I have emphasized key Albireo features. NOT in the Read Path and Flexibility in deployment. These really do set Albireo apart from the early days of deduplication and other similar approaches being discussed today. However, I believe the most important issue with data storage is and always will be - [...]]]></description>
			<content:encoded><![CDATA[<p>In my last two posts, I have emphasized key Albireo features. <a href="http://blog.permabit.com/index.php/2010/08/not-in-the-read-path-or-performance-matters/" >NOT in the Read Path</a> and <a href="http://blog.permabit.com/index.php/2010/08/albireo-flexibility/" >Flexibility</a> in deployment. These really do set Albireo apart from the early days of deduplication and other similar approaches being discussed today. However, I believe the most important issue with data storage is and always will be - how safe is my data? After all your business depends on your data and you need to be confident data is available and can be accessed even if some of your storage technology fails or becomes obsolete. <span id="more-959"></span><br />
As storage technologies evolve, they take several steps in their evolution from stand alone point solutions to appliances and eventually to integrated or embedded offerings and sometimes onto a chip.  With data optimization we are moving into the integrated/embedded stage of deduplication with Albireo.  There are some very significant differences compared to previous stages as this step occurs and the most prominent is data safety. Why you might ask?  In the early stages, appliances or point software solutions, data is changed or modified as it is processed/stored by the application and/or appliance. In order to read that data, the software or appliance needs to be in the read path to interpret or reconstitute the data.  Most of the time this is easily done, but at a cost. There is a processing overhead associated with this activity.  There is a much larger concern however &#8212; the <em>risk</em> associated with it. What happens if the appliance of software fails or becomes obsolete due to acquisition or as a result of business disruption? Can you read your data? Is there a fall back plan? Can you access the data and run some sort of interpreter to get your data back to readable form so you can continue to run your business?</p>
<p>Albireo addresses the data safety issue head on. It never changes the data! Unlike a point solution or appliance, Albireo also doesn&#8217;t change the data or impact the read/write process (see <a href="http://blog.permabit.com/index.php/2010/08/not-in-the-read-path-or-performance-matters/" >NOT in the Read Path</a>) because it is an advisory service to the storage stack. Simply said, Albireo analyzes the data, determines if it is duplicate and advises the storage that it is or is not a duplicate. If it is not a duplicate, the storage system does nothing to the data. If it is a duplicate, the storage system can create a pointer to the original data and remove the duplicate releasing storage space and reducing associated costs. The effect is that there will be less storage consumed, less power/cooling needed, less data center footprint and less storage to be bought to meet anticipated data growth.  These impacts drop to your CAPEX and OPEX providing budget relief!</p>
<p>How does this impact data safety? Simple, the data is written as it always would be by the storage system in its native. It is never changed or modified nor does it need an interpreter to reconstitute it.  There is no third party dependency! So if the data optimization functionality is disabled, removed, or fails data can still be read. Your data is as safe as it always is and is protected by your storage system!</p>
<p>Our partners will be deploying it in storage systems later this year and next year. If your storage vendor is not discussion this technology advance with you ask them about it!</p>
<p>Albireo is a technological step forward. It will impact your business in a very positive way and it will be easier, more efficient, eliminate risk and be less costly as it optimizes the data storage your business runs upon. Isn&#8217;t that what progress is all about!</p>
]]></content:encoded>
			<wfw:commentRss>http://blog.permabit.com/index.php/2010/08/data-safety-its-the-only-issue/feed/</wfw:commentRss>
		</item>
		<item>
		<title>Albireo Flexibility</title>
		<link>http://blog.permabit.com/index.php/2010/08/albireo-flexibility/</link>
		<comments>http://blog.permabit.com/index.php/2010/08/albireo-flexibility/#comments</comments>
		<pubDate>Thu, 19 Aug 2010 15:08:54 +0000</pubDate>
		<dc:creator>Wayne Salpietro</dc:creator>
		
		<category><![CDATA[Wayne Salpietro, Director PMM]]></category>

		<category><![CDATA[Albireo]]></category>

		<category><![CDATA[data optimization]]></category>

		<category><![CDATA[dedupe]]></category>

		<category><![CDATA[deduplication]]></category>

		<category><![CDATA[deployment]]></category>

		<category><![CDATA[flexibility]]></category>

		<category><![CDATA[primary storage]]></category>

		<category><![CDATA[primary storage deduplication]]></category>

		<guid isPermaLink="false">http://blog.permabit.com/?p=955</guid>
		<description><![CDATA[As we talked to potential OEMs for our Albireo Data Optimization offering several key requirements kept emerging. One was &#8216;don&#8217;t impact performance&#8217;, followed by &#8216;don&#8217;t impact my existing feature set (because I&#8217;ve invested $$$ in it)&#8217; and finally &#8216;please make your deduplication easy and flexible to deploy.&#8217; As we developed Albireo, we kept these in [...]]]></description>
			<content:encoded><![CDATA[<p style="text-align: left;">As we talked to potential OEMs for our Albireo Data Optimization offering several key requirements kept emerging. One was &#8216;don&#8217;t impact performance&#8217;, followed by &#8216;don&#8217;t impact my existing feature set (because I&#8217;ve invested $$$ in it)&#8217; and finally &#8216;please make your deduplication easy and flexible to deploy.&#8217; As we developed Albireo, we kept these in mind and have succeeded in addressing each!  <span id="more-955"></span></p>
<p>&#8216;Don&#8217;t impact performance&#8217; I addressed in my blog last week: <a href="http://blog.permabit.com/index.php/2010/08/not-in-the-read-path-or-performance-matters/" >&#8216;Not in The Read Path&#8217;</a>. Simply said, we don&#8217;t impact performance and provide a simple and elegant method of implementing deduplication. As a result of our approach, we developed a dedupe advisory approach that also enables the OEM to utilize all of their existing features as well!</p>
<p>When it comes to flexibility, we really took a step forward. We knew the concerns about deduplication technology revolved around 3 key areas:</p>
<p>* Hash key development/indexing and performance impact</p>
<p>* Resource utilization</p>
<p>* Deployment flexibility.</p>
<p>Hash keys are the lifeblood of deduplication in that they are the unique identifier of a data chunk. Once the key is developed for a chunk of data the next question is &#8216;have I seen this data before?&#8217; This is where it gets difficult because doing this look up is what causes most dedupe systems to come to a grinding halt. Big indexes and large keys (yes we use SHA 256 so the key is large) create performance slow downs and in some lesser systems, scalability limits.  We skinned that cat with <a href="http://permabit.com/albireo/architecture.asp" >patented indexing</a> that can return an index lookup in a few micro seconds! Hardly impact the performance at all! And scale out is also managed by patented memory based indexing that is 99.5% memory resident! No slow down there either!  Hardly ever see a disk fetch!   Once we do this we return advice to the OEM storage stack that the data chunk is a duplicate. The OEM then creates a pointer for the duplicate.</p>
<p>Resource utilization or how much of the processing power is consumed is a significant issue for storage vendors.  For example, newer systems have fast quad core processors (sometimes 2!)  while older or less high performance systems may have dual core processors.  In any case a flexible dedupe engine should enable the OEM to define the number of processors they used for Albireo depending on their overall system performance criteria. We designed that flexibility into Albireo to ensure our deduplication did not consume resources that would impact our partners performance.</p>
<p>Flexibility as you can see from the resource utilization comments is already there but we took an additional step by enabling our OEM partners to deploy Albireo in <a href="http://permabit.com/albireo/deployment.asp" >inline, post-process or parallel (hybrid) methods</a>.  For example, if the partner performance tests enable an inline deduplication within the OEMs performance characteristics then they can implement Albireo inline!   If they want to balance their performance and deduplication efficiency then parallel mode may be the best for that partner. And finally, if high performance storage were the dominant criteria, then post-process mode for Albireo would work for that partner! We also give the option for the OEM to leverage all 3 so depending on use cases, they can implement the appropriate method (all within one system). Their choice!</p>
<p>All this is done with <a href="http://permabit.com/albireo/deployment.asp" >an SDK</a> that can be integrated with 6-8 API calls. Some of our partners have integrated in as little as a few days. Albireo is an OEM partners tool that they can deploy based on their performance, resource and efficiency design criteria.  We made it that way!</p>
<p>In my next posting I&#8217;ll discuss data protection.</p>
]]></content:encoded>
			<wfw:commentRss>http://blog.permabit.com/index.php/2010/08/albireo-flexibility/feed/</wfw:commentRss>
		</item>
		<item>
		<title>NOT in the Read Path or &#8216;Performance Matters&#8217;</title>
		<link>http://blog.permabit.com/index.php/2010/08/not-in-the-read-path-or-performance-matters/</link>
		<comments>http://blog.permabit.com/index.php/2010/08/not-in-the-read-path-or-performance-matters/#comments</comments>
		<pubDate>Thu, 12 Aug 2010 15:35:06 +0000</pubDate>
		<dc:creator>Wayne Salpietro</dc:creator>
		
		<category><![CDATA[Wayne Salpietro, Director PMM]]></category>

		<category><![CDATA[Albireo]]></category>

		<category><![CDATA[capex]]></category>

		<category><![CDATA[data optimization]]></category>

		<category><![CDATA[dedupe]]></category>

		<category><![CDATA[deduplication]]></category>

		<category><![CDATA[enterprise storage]]></category>

		<category><![CDATA[opex]]></category>

		<category><![CDATA[primary storage]]></category>

		<category><![CDATA[primary storage deduplication]]></category>

		<guid isPermaLink="false">http://blog.permabit.com/?p=950</guid>
		<description><![CDATA[In Mike&#8217;s Blog a couple of weeks ago, &#8216;Get Out of My Way&#8217;, he accurately portrayed our Albireo offering as &#8216;not in the read path&#8217;.  In primary storage implementations, it&#8217;s critical to ensure the most efficient access to data. &#8216;Out of the read path&#8217; has been one of our key messages that bear repeating because [...]]]></description>
			<content:encoded><![CDATA[<p>In Mike&#8217;s Blog a couple of weeks ago, <a href="../../../../../index.php/2010/07/get-out-of-my-way/">&#8216;Get Out of My Way&#8217;</a>, he accurately portrayed our Albireo offering as &#8216;not in the read path&#8217;.  In primary storage implementations, it&#8217;s critical to ensure the most efficient access to data. &#8216;Out of the read path&#8217; has been one of our key messages that bear repeating because it&#8217;s so significant. Albireo is embedded in the storage vendors stack so we have been extremely careful not to impact their operations. As such, we have developed Albireo around three key principles: performance, simplicity of deployment and data safety. <span id="more-950"></span>In this post, I&#8217;ll look at the most prevalent concern of our partners &#8212; performance.</p>
<p>The most significant concern raised as we discussed Albireo with our partners has been the potential impact to performance. In many ways it&#8217;s their stock and trade and they protect it as any business should &#8216;with their lives!&#8217; The thought of embedding deduplication technology, even though there is a clear customer benefit in cost, space and power/cooling reduction, is orthogonal to their performance mantra particularly in primary storage.</p>
<p>As we designed and developed Albireo, we specifically addressed the most critical of all performance criteria - data read. Why is it significant? Because data is usually written once, but can be read hundreds, thousands or millions of times clearly affecting the performance of the storage.  As a result, <strong>Albireo is NOT in the read path &#8212; so there is no performance impact on read!</strong></p>
<p>In the write process, we also were careful to provide our partners choices that enable them to determine whether to impact their performance or not.  Albireo&#8217;s flexibility enables inline, parallel and post-process approaches (or any combination of the three) so the storage vendor can choose which approach they prefer. Let&#8217;s look at each of these:</p>
<p>* Of course in an inline approach. there is <strong>a very small write latency</strong> as Albireo does a hash key create and lookup. When we say very small, we&#8217;re talking mere <strong>microseconds </strong>across the entire data stream. Can a user detect a microsecond delay?  I think not!</p>
<p>* In a <strong>parallel approach. there is NO performance impact on write</strong> at all because the vendor continues to write the data as they normally do.  Albireo provides duplicate advice to the storage and the vendor can apply it when cycles permit.</p>
<p>* In a <strong>post-process deduplication approach. there is NO performance write impact</strong> because a write is done as the vendor normally would and a post-process deduplication can be initiated as cycles are available.</p>
<p><strong> </strong></p>
<p><strong>In summary, Albireo does NOT impact performance on read! There is also NO impact to write performance in parallel or post-process modes.  So, the vendor concerns about performance are addressed! </strong></p>
<p><strong> </strong></p>
<p>In my next post I&#8217;ll address implementation simplicity.</p>
]]></content:encoded>
			<wfw:commentRss>http://blog.permabit.com/index.php/2010/08/not-in-the-read-path-or-performance-matters/feed/</wfw:commentRss>
		</item>
		<item>
		<title>Why &#8216;The Summer of 2010&#8242; changed the storage landscape&#8230;</title>
		<link>http://blog.permabit.com/index.php/2010/08/why-the-summer-of-2010-changed-the-storage-landscape/</link>
		<comments>http://blog.permabit.com/index.php/2010/08/why-the-summer-of-2010-changed-the-storage-landscape/#comments</comments>
		<pubDate>Thu, 05 Aug 2010 14:50:19 +0000</pubDate>
		<dc:creator>Wayne Salpietro</dc:creator>
		
		<category><![CDATA[Wayne Salpietro, Director PMM]]></category>

		<category><![CDATA[capex]]></category>

		<category><![CDATA[compression]]></category>

		<category><![CDATA[data optimization]]></category>

		<category><![CDATA[dedupe]]></category>

		<category><![CDATA[deduplication]]></category>

		<category><![CDATA[enterprise storage]]></category>

		<category><![CDATA[opex]]></category>

		<category><![CDATA[primary dedupe]]></category>

		<category><![CDATA[primary storage]]></category>

		<category><![CDATA[primary storage deduplication]]></category>

		<category><![CDATA[unified storage]]></category>

		<guid isPermaLink="false">http://blog.permabit.com/?p=946</guid>
		<description><![CDATA[ 
 
 
Usually summer is a time for reflection and sun tanning at the beach. From a business perspective things tend to slow down a bit while each of us takes a second breath and spend some quality time with our families away from the business side of our lives. There is usually not [...]]]></description>
			<content:encoded><![CDATA[<p align="center"><strong> </strong></p>
<p align="center"><strong> </strong></p>
<p align="center"><strong> </strong></p>
<p>Usually summer is a time for reflection and sun tanning at the beach. From a business perspective things tend to slow down a bit while each of us takes a second breath and spend some quality time with our families away from the business side of our lives. There is usually not much industry news few &#8216;big deals&#8217; occur that were not already in play. Technology shift announcements are typically saved for September when everyone is back, so there is as much traction as possible. Not the case for the summer of 2010!<span id="more-946"></span></p>
<p>It may have all started in early June when Permabit announced Albireo. Albireo is a simple and elegant data optimization technology that makes it possible to embed data deduplication into primary storage arrays with a few API calls to the existing storage firmware/operating environment.  The press, analysts, OEMs and bloggers,  were very supportive and effusive: <a href="http://www.infostor.com/index/blogs_new/Jeff-Boles-Blog.blogs.infostor.jeff-boles-blog.post987_5070808334471504944.html" onclick="javascript:pageTracker._trackPageview('/outbound/article/www.infostor.com');">&#8216;the tectonic plates of storage have shifted&#8217;</a> or <a href="http://www.theregister.co.uk/2010/06/07/permabit_albireo/" onclick="javascript:pageTracker._trackPageview('/outbound/article/www.theregister.co.uk');">&#8216;Killer dedupe&#8217;</a> or <a href="http://www.permabit.com/pressreleases/permabit-anns-albireo-hi-perf-sw.asp" >&#8216;This stuff is so far ahead in its capabilities and performance I can&#8217;t see why you (OEMS) would want to do it yourself&#8217;</a> to mention a few.  And yes, there were competitors snipes in the blogosphere - few were given any credence because of the support from press, analysts and OEMs. This started a domino effect that continues today.</p>
<p>A few weeks later, <a href="http://h18006.www1.hp.com/storage/new/index.html" onclick="javascript:pageTracker._trackPageview('/outbound/article/h18006.www1.hp.com');">HP announced StoreOnce</a> at their conference and talked about end-to-end deduplication (data optimization) setting a great vision! They went on to announce and describe their implementation of HP Labs deduplication technology (StoreOnce) via their D2D backup appliance. The rest of the details will evolve as they deliver and fulfill the end-to-end vision over the next 18-24 months.</p>
<p>Not to be upstaged by either a startup from Cambridge or the second largest storage vendor on the planet, EMC said &#8216;We got that&#8217; as they announced their <a href="http://www.theregister.co.uk/2010/06/25/emc_viper/" onclick="javascript:pageTracker._trackPageview('/outbound/article/www.theregister.co.uk');">&#8216;Viper Team&#8217;</a> was working on data optimization (deduplication &amp; compression) technologies that would concatenate the IP within EMC and offer an ability to apply end-to-end data optimization on their entire storage offering - another vision of the end state.  Details to be announced later!</p>
<p>Rumors were flying by then involving IBM and Dell acquisitions. <a href="http://www.informationweek.com/news/hardware/data_centers/showArticle.jhtml?articleID=226000101" onclick="javascript:pageTracker._trackPageview('/outbound/article/www.informationweek.com');">. Dell would eventually buy Ocarina</a> for their IP in compression and deduplication via Kleiner-Perkins with the implied intent to deploy data optimization across their storage offering. They also want to become a storage supplier beyond their own offerings by hoping to engage with the OEM partners that Ocarina had partnered with.</p>
<p><a href="http://www.dataprotectionperspectives.com/2010/07/remove-another-chair-ibm-snatches-storwize" onclick="javascript:pageTracker._trackPageview('/outbound/article/www.dataprotectionperspectives.com');">IBM would announce their acquisition of Storwize</a> whose compression IP could eventually be embedded into IBM storage. This would be even more powerful if combined with deduplication (isn&#8217;t that data optimization too!) and reduce storage consumption for their customers.</p>
<p>All that in June and July!  What will August bring?</p>
<p>For end customers &#8216;this changes everything!&#8217;  Why you might ask?  Data stored in primary data stores will be compressed and/or deduplicated saving 50-75% or more of the storage space required. As a result, effective costs of storage will be substantially reduced, data center floor space requirements will drop proportionally, cooling and power will do the same and resources to manage ever expanding data stores will become more efficient. In parallel, storage drives will continue to become denser and 2, 4 and 6TB drives will be seen in storage array deployments during 2010 and 2011. Storage vendors will also begin deployment of more efficient unified storage technologies that incorporate embedded deduplication and/or compression, while legacy storage may be enhanced to incorporate embedded deduplication and/or compression. As end-to-end data optimization (extending the impact of dedupe/compression to secondary, archive and backup storage) is deployed, the cost reductions mention above regarding primary storage will multiply and storage expenses will drop significantly as a proportion of IT budgets! Bottom line - your CAPEX and OPEX will be significantly reduced as a result. That&#8217;s why the events of the summer of 2010 changed everything!</p>
<p>If your storage supplier is not telling you they have or will have data optimization in their primary storage offerings, if not end-to-end, I&#8217;d ask why not and reconsider your storage vendor choices!</p>
]]></content:encoded>
			<wfw:commentRss>http://blog.permabit.com/index.php/2010/08/why-the-summer-of-2010-changed-the-storage-landscape/feed/</wfw:commentRss>
		</item>
		<item>
		<title>Summer Heat, Baseball and Data Optimization</title>
		<link>http://blog.permabit.com/index.php/2010/07/summer-heat-baseball-and-data-optimization/</link>
		<comments>http://blog.permabit.com/index.php/2010/07/summer-heat-baseball-and-data-optimization/#comments</comments>
		<pubDate>Fri, 30 Jul 2010 17:01:24 +0000</pubDate>
		<dc:creator>Tom Cook</dc:creator>
		
		<category><![CDATA[Tom Cook, CEO]]></category>

		<category><![CDATA[compression]]></category>

		<category><![CDATA[dedupe]]></category>

		<category><![CDATA[enterprise archive]]></category>

		<category><![CDATA[primary storage]]></category>

		<category><![CDATA[primary storage deduplication]]></category>

		<guid isPermaLink="false">http://blog.permabit.com/?p=941</guid>
		<description><![CDATA[Whew&#8230;things are certainly heating up. Last week, Dell agreed to acquire Ocarina Networks for an undisclosed amount. As I mentioned to Chris Mellor at the Register, this was a good indication of how primary storage is, &#8220;moving from &#8216;bump in the wire&#8217; compression/deduplication solutions to integrated capacity optimization&#8221;. 
And, yesterday, IBM announced the acquisition of [...]]]></description>
			<content:encoded><![CDATA[<p>Whew&#8230;things are certainly heating up. Last week, Dell agreed to acquire Ocarina Networks for an undisclosed amount. As I mentioned to Chris Mellor at <a href="http://www.theregister.co.uk/2010/07/20/dell_buys_ocarina/" onclick="javascript:pageTracker._trackPageview('/outbound/article/www.theregister.co.uk');">the Register</a>, this was a good indication of how primary storage is, &#8220;<strong>moving from &#8216;bump in the wire&#8217; compression/deduplication solutions to integrated capacity optimization&#8221;. </strong></p>
<p>And, yesterday, IBM announced the acquisition of Storwize. This is another great proof point for primary data optimization going main stream. Here&#8217;s how we view these developments:<span id="more-941"></span></p>
<ul class="unIndentedList">
<li> Primary data optimization is emerging and will gain momentum over the coming months as storage companies seek to protect margins and improve their competitive positions.</li>
<li> Optimization technologies that are out of the data path and preserve data safety and storage performance will win in the market.</li>
</ul>
<p>From our perspective, it&#8217;s still early in the game. Both Storwize and Ocarina sold data path appliances - and that is a very challenging (some would say, impossible) business model. Both are better as &#8220;owned&#8221; assets. Here is our take on the deals.</p>
<p><strong>Dell/Ocarina</strong></p>
<p>We are hearing from many who are questioning this transaction.</p>
<ul class="unIndentedList">
<li> Ocarina&#8217;s attempt to move from vertical (Media/Entertainment &amp; Oil &amp; Gas) to horizontal enterprise application will be a significant challenge. Moving from secondary and application niches to primary and enterprise - is like going from Summer League to the Big Show (baseball metaphor).</li>
<li> Ocarina signed some nice names for &#8220;meet in the market&#8221; appliance deals, but received limited to no traction since their signing.</li>
<li> Dell acquired an appliance technology (and maybe some beta code)? Is this a technology fit for Dell?
<ul>
<li> Dell has a limited storage portfolio today - they only own two SKUs of any significance (Windows Storage Server and EqualLogic).</li>
<li> Ocarina is a doubtful bolt on to CLARiiON which is in the middle of the Dell storage portfolio! How does Dell go end to end? Big stretch.</li>
<li> In this case end-to-end data optimization is at best point-to-point if the Ocarina embedded technology is valid!</li>
<li> EqualLogic begs the question of &#8220;what is the technology value of Ocarina content aware compression in block environments?&#8221;</li>
<li> Dell has had limited integration track record. Will Ocarina be different?</li>
</ul>
</li>
<li> Dell has some heavy lifting to do. End to end in anything is hard work and not for the faint of heart. It takes a full product portfolio and a robust technology solution to make it happen.</li>
</ul>
<p><strong>IBM/Storwize</strong></p>
<p>From our perspective, IBM&#8217;s acquisition of Storwize makes sense.</p>
<ul class="unIndentedList">
<li> Compression is a great first step toward data optimization for IBM.</li>
<li> If combined with a unified deduplication capability, IBM could have a best of breed data optimization solution giving IBM a lead on its competitors.</li>
<li> Storwize compression today is an &#8220;in the data path&#8221; operation. From our perspective, that reason alone makes sense for IBM to own it - lock, stock and barrel - so it can be embedded into their storage architectures to minimize its impact.</li>
<li> From our take, IBM hits a double (another baseball metaphor) and the heat will continue to quickly rise around the value of data optimization. How they move this forward to potentially score the winning run (yet another&#8230;) to compete with HP, EMC, HDS and now Dell, will play out over the coming months.</li>
</ul>
<p><strong>Permabit</strong></p>
<p>There&#8217;s been much speculation around who Permabit is working with and what happens next.  We began introducing OEMs to Albireo primary data optimization about 9 months ago. We&#8217;ll confirm, we are working to enable dozens of storage products with deduplication that will enter the market in 2010-2012 and leave the speculation to <a href="http://searchstorage.techtarget.com/news/article/0,289142,sid5_gci1517246,00.html" onclick="javascript:pageTracker._trackPageview('/outbound/article/searchstorage.techtarget.com');">Dave Raffo at SearchStorage</a> and others as to who our partners are.  Again, it is early, but we are on a mission to provide Albireo data optimization to storage vendors to leverage their R&amp;D and market reach and make their storage better, faster and more efficient.</p>
<p>We firmly believe for any company to make it past the early innings (final baseball metaphor) with an OEM data optimization solution, they need to offer the following:</p>
<ul class="unIndentedList">
<li> Solid, tested architecture - seasoned</li>
<li> No performance impact - capable of handling demanding primary workloads</li>
<li> No data safety compromise - out of read path</li>
<li> Scale - to PBs of data capacity</li>
<li> Best of class dedupe rates - for massive storage savings</li>
</ul>
<p>Albireo offers all of that and the fastest route to market for storage companies looking to add data optimization to their products. For those of you keeping score, what do you think?</p>
]]></content:encoded>
			<wfw:commentRss>http://blog.permabit.com/index.php/2010/07/summer-heat-baseball-and-data-optimization/feed/</wfw:commentRss>
		</item>
	</channel>
</rss>
