Archive for the ‘emc’ Category

vSpecialists get the job done! (#VMworld EMC vSpecialist Rap Video!)

August 31st, 2010

Whoa, wait a minute, cut that track, Yo Dre.. What is this?!? A Rap Video by the vSpecialists?!?

Yea, seriously, this IS a Rap Video! Albeit sure you’re saying “Wow, isn’t that Circa Chicago Bears in the mid 1980’s with the superbowl shuffle?!” And I would go so far as to say.. The bears were rockstars at that time were they not?    This is simply a further reference to just how far these guys go the extra mile, both in their ability to get things actually done (as they say so in there) but a testament what each of these individuals do.   When they’re not busy doing their actual jobs, or studying or taking exams, or writing blog posts, or writing chapters for the next greatest Virtualization book you’ll be buying… or traveling all over the place, OR building out LABS and testing out things in every possible configuration and publishing the results..

Well… when all of that is said… and all of that is done.    They make rap videos! :)

I hope you enjoy it! The Production quality of this is absolutely AMAZING! I saw it on a big screen and I was floored at how well it was done (I always judge for production quality before I even look at content.. and the content is crazy good times ;))

Tags: , , , ,
Posted in VMworld, Virtualization, emc, vSphere, vmware | Comments (View Comments)

Data Longevity, VMware deduplication change over time, NetApp ASIS deterioration and EMC Guarantee

August 18th, 2010

Hey guys, the other day I was having a conversation with a friend of mine that went something like this.

How did this all start you might say?!? Well, contrary to popular belief, I am a STAUNCH NetApp FUD dispeller.  What that means is, if I hear something said about NetApp by a competitor, peer, partner or customer which I feel is incorrect or just sounds interesting; I task it upon myself to prove/disprove it because well frankly… People still hit me up with NetApp questions all the time :) (And I’d like to make sure I’m supplying them with the most accurate and reflective data! – yea that’s it, and it has nothing to do with how much of a geek I am.. :))

Well, in the defense of the video it didn’t go EXACTLY like that.   Here is a little background on how we got to where that video is today :)   I recently overheard someone say the following:

What I hear over and over is that dedupe rates when using VMware deteriorate over time

And my first response was “nuh uh!”, Well, maybe not my FIRST response.. but quickly followed by; “Let me try and get some foundational data”  because you know me… I like to blog about things and as a result collect way too much data to try to validate and understand and effectively say whatever I say accurately :)

The first thing I did was engage several former NetApp folks who are as agnostic and objective as I am to get their thoughts on the matter (we were on the same page!)Data collection time!  

For Data Collection… I talked to some good friends of mine regarding how their Dedupe savings have been over time because they were so excited when we first enabled it in the first place (And I was excited for them!)   This is where I learned some… frankly disturbing things (I did talk to numerous guys named Mike interestingly enough, and on the whole all of those who I talked with and their data they shared with me reflected similar findings)

Disturbing things learned!

Yea I’ve heard all the jibber jabber before usually touted as FUD that NetApp systems will deteriorate over time in general (whether it be Performance, whether it be Space Savings) etc etc. 

Well some of the disturbing things learned actually coming from the field on real systems protecting real production data was:

  • Space Savings are GREAT, and will be absolutely amazing in the beginning! 70-90% is common… in the beginning. (Call this the POC and the burn-in period)
  • As that data starts to ‘change’ ever so slightly as you would expect your data to change (not sit static and RO) you’ll see your savings start to decrease, as much as 45% over a year
  • This figure is not NetApp’s fault.  Virtual machines (mainly what we’re discussing here) are not designed to stay uniformly the same no matter what in accordance to 4k blocks, so the very fact that they change is absolutely normal so this loss isn’t a catastrophe, it’s a fact of the longevity of data.
  • Virtual Machine data which is optimal for deduplication typically amounts to 1-5% of the total storage in the datacenter.   In fact if we want to lie to ourselves or we have a specific use-case, we can pretend that it’s upwards of 10%, but not much more than that.  And this basically accounts for Operating System, Disk Image, blah blah blah – the normal type of data that you would dedupe in the first place.
    • I found that particularly disturbing because after reviewing the data from these numerous environments… I had the impression VMware data would account for much more!   I saw a 50TB SAN only have ~2TB of data residing in Data stores and of that only 23% of it was deduplicating (I was shocked!)
    • I was further shocked that when reviewing the data that over the course of a year on a 60TB SAN, this customer only found 12TB of data they could justify running the dedupe process against and of that they were seeing less than 3TB of ‘duplicate data’ coming in around 18% space savings over that 12TB.    The interesting bit is that the other 48TB of data just continued on un-affected by dedupe.   (Yes, I asked why don’t they try to dedupe it… and they did in the lab and, well it never made it into production)

    At this point, I was even more so concerned.   Concerned whether there was some truth to this whole NetApp starts really high in the beginning (Performance/IO way up there, certain datasets will have amazing dedupe ratios to start) etc. and then starts to drop off considerably over time, while the EMC equivalent system performs consistently the entire time.

    Warning! Warning Will Robinson!

    This is usually where klaxons and red lights would normally go off in my head.    If what my good friends (and customers) are telling me is accurate, it is that not only will my performance degrade just by merely using the system, but my space efficiency will deteriorate over time as well.    Sure we’ll get some deduplication, no doubt about that!  But the long term benefit isn’t any better than compression (as a friend of mine had commented on this whole ordeal)    With the many ways of trying to look at this and understand I discussed it with my friend Scott who had the following analogy and example to cite with this:

    The issue that I’ve seen is this:

    Since a VMDK is a container file, the nature of the data is a little different than a standard file like a word doc for example.

    Normally, if you take a standard windows C: – like on your laptop, every file is stored as 4K blocks.  However, unless the file is exactly divisible by 4K (which is rare), the last block has just a little bit of waste in it.  Doesn’t matter if this is a word doc, a PowerPoint, or a .dll in the \windows\system32 directory, they all have a little bit of waste at the end of that last block.

    When converted to a VMDK file, the files are all smashed together because inside the container file, we don’t have to keep that 4K boundary.  Kind of like sliding a bunch of books together on a book shelf eliminating the wasted space.  Now this is one of the cool things about VMware that makes the virtual disk more space efficient than a physical disk – so this is a good thing.

    So, when you have a VMDK and you clone it – let’s say create 100 copies and then do a block based dedupe – you’ll get a 99% dedupe rate across those virtual disks.  That’s great – initially.  Netapp tends to calculate this “savings” into their proposals and tell customers that require 10TB of storage, that they can just buy 5TB and dedupe and then they’ll have plenty of space.

    What happens is, that after buying ½ the storage they really needed the dedupe rate starts to break down. Here’s why:

    When you start running the VMs and adding things like service packs or patches for example – well that process doesn’t always add files to the end of the vmdk.  It often deletes files from the middle, beginning, end and then  replaces them with other files etc.  What happens then is that the bits shift a little to the left and the right – breaking the block boundaries. Imagine adding and removing books of different sizes from the shelf and making sure there’s no wasted space between them.

    If you did a file per file scan on the virtual disk (Say a windows C: drive), you might have exactly the same data within the vmdk, however since the blocks don’t line up, the block based dedupe which is fixed at 4K sees different data and therefore the dedupe rate breaks down.

    A sliding window technology (like what Avamar does ) would solve this problem, but today ASIS is fixed at 4K. 

    Thoughts?

    If you have particular thoughts about what Scott shared there, feel free to comment and I’ll make sure he reads this as well; but this raises some interesting questions.   

    We’ve covered numerous things in here, and I’ve done everything I can to avoid discussing the guarantees I feel like I’ve talked about to death (linked below) so addressing what we’ve discussed:

    • I’m seeing on average 20% of a customers data which merits deduping and of that I’m seeing anywhere from 10-20% space saved across that 20%
    • Translation: 100TB of data, 20TB is worth deduping reclaiming about 4TB of space in total; thus on this conservative estimate you’d get about 4-5% space saved!
    • Translation: When you have a 20TB data warehouse and you go to dedupe it (You won’t) you’ll see no space gained, with a 100% cost across it.
    • With the EMC Unified Storage Guarantee, that same 20TB data warehouse will be covered by the 20% more efficient guarantee (Well, EVERY data type is covered without caveat)   [It’s almost like it’s a shill, but it really bears repeating because frankly this is earth shattering and worth discussing with your TC or whoever]

    For more great information on EMC’s 20% Unified Storage Guarantee – check out these links (and other articles I’ve written on the subject as well!)

    EMC Unified Storage is 20% more efficient Guaranteed

    I won’t subject you to it, especially because it is over 7 minutes long, but here is a semi funny (my family does NOT find it funny!) video about EMCs Unified Storage Guarantee and making a comparison to NetApp’s Guarantee.   Various comments included in the description of the video – Don’t worry if you never watch it… I won’t hold it against you ;)

    Be safe out there, the data jungle is a vicious one!   If you need any help driving truth out of your EMC or NetApp folks feel free to reach out and I’ll do what I can :)

    SPOILERS!!!

    Read the rest of this entry »

    Tags: , , , , , ,
    Posted in Avamar, CLARiiON, Celerra, Deduplication, Efficiency, NAS, NetApp, SQL, Storage, Virtualization, emc, vSphere, vmware | Comments (View Comments)

    EMC Unified Storage Guarantees my speedometer will always be 20% faster!

    July 6th, 2010

    Hi, my name is Erik Estrada, you may remember me from such YouTube Video’s as Speeding Ticket w/Erik Estrada – EMC 20% Guarantee

    This is actually a particularly funny video – Any time a celebrity shows up in a video, it’s pretty funny. :)

    First thing I did in this video was deeply analyze it with loads of commentary! (You may notice I didn’t do that in my previous post, EMC Unified Storage Guarantee improves your Golf Game by 20% – Which is accurate because… Golf isn’t my thang so much, so improving my game by 20% isn’t saying much ;) that and it’s good to take a break from discussions)

    But this video in particular… it drew up a few questions to the top of my mind:

    • Is that really someone’s car with a CIO license plate in california?! (I considered calling it in to determine who owns that plate… but I didn’t.. ;))
    • Is Sarah Phillips intended to represent someone internally/externally? (Not sure, I couldn’t find anything which jumped out…)
    • Why is it she only handed pieces of paper?! Is that the new greener California License?! made of paper?! :)
    • Why isn’t Sarah wearing a seatbelt? (Yes, I realize this is staged… :))
    • If I got pulled over for 78 in a 65… well, let’s just say, I’m glad I work for EMC… ;)

    All in all though, it’s a fairly funny video which quite frankly… if I could get a 20% guarantee on my speed limit, I’d be ALL over that – Though with that being the case, I’d hope that everyone else on the road equally had a similar 20% guarantee or at least introduce some vehicles for us to get to FAST (Oh my god that was a terrible tie-in/joke! :))

    Tags: , , , , ,
    Posted in Storage, Unified Storage, emc | Comments (View Comments)

    EMC Unified Storage Guarantee improves your Golf Game by 20%

    July 6th, 2010

    I finally had the chance to check out EMC Unified Storage Guarantee video with PGA Pro Bill Andrade competing against Pro-Amateur EMC Executive Bill Scannell

    If you haven’t seen this yet, it’s pretty funny and covers things at a much higher level than all of the previous discussions I’ve had on this subject :)

    Tags: , , , , ,
    Posted in NetApp, Storage, Unified Storage, emc | Comments (View Comments)

    EMC Levels the playing field with Cloud based storage offering (ATMOS Online)

    July 6th, 2010

    I’ve been meaning to weigh in on this, but with the holiday weekend… well, you get the post-weekend weigh-in after everyone has had the chance to say everything good and bad on this (You’d be amazed at how little bad has been said on the subject.  I say how little because it’s ALL been re-iterations of the same register article)

    ….What happened?!?? I was just using ATMOS Yesterday and all of a sudden every competitor of EMC is telling me that ATMOS is DEAD and it’s time to close up shop?!? Whoa, hold on there pardner, that’s not what we’re saying at all! (Yes, that is exactly how *I* read it from the NUMEROUS competitive coverage of this event occurring and happening :))

    As far as you and I should be concerned, ATMOS hasn’t changed at all; except perhaps getting a little simpler in your choices of where to store ATMOS.

    Stolen from Chad Sakac's blog! :)

    Prior to this June 29th event, you would have the choice of ATMOS Hosted internally as part of your Private Cloud.   ATMOS Hosted externally at EMC’s ATMOS Online, or ATMOS Hosted externally at one of the three Service Providers listed below.

    Cloud Service Provider Service Description
    AT&T Synaptic Storage as a Service
    Hosted Solutions Stratus Cloud Storage
    PEER 1 Hosting CloudOne Storage

    Now… as far as Production Grade ATMOS Online goes; You can choose between Internally, and Externally with the Cloud Service Providers where you can act and operate against their existing SLA structure you may likely be familiar with.

    Looks pretty straight-forward if you ask me!   I sat down, read all of the data which was available to me, and I saw a strategic decision being taken in order to remove the competitive pressures between EMC’s own sales force, and that of the partners and service providers offering up the ATMOS Cloud Service to the general purpose community.   Or in the words of Dave Simpson who so eloquently said:

    The winners, of course, are EMC’s Atmos cloud storage partners, which no longer have to compete with EMC. Likewise, EMC no longer has to compete with those partners. It’s a win-win on that front. In fact, on its Atmos web site, EMC is “strongly encouraging” its existing Atmos customers to migrate to one of its partners.

    …..

    The shuttering of Atmos Online should be seen not as a failure on the part of EMC but, rather, as another prescient and tactical move by the company.

    And quite frankly Dave said it well; which I feel takes the whole debate about “where” should I host my Cloud Storage off the table; Discussion point over.   But what now?

    With the service providers discussion off the table, I think it is important to take the time to focus on customers and the community

    The EMC ATMOS Online Community is still strong and hearty.     Your vehicle for open discussion on this matter is still available and at your disposal.

    The ATMOS Online service can continued to be used for Development Purposes.

    Keeping your Cloud Private and doing your own internal testing has never been stronger with the release of the ATMOS Online Virtual Server Appliance (VSA)

    And last but not least – Customers DO continue to use this, and STILL will continue to use this.   This is best of breed Cloud Storage in a growing world which needs more versatile solutions at your disposal.    There will continue to be businesses like eBay who selected ATMOS to create a reliable, scalable cloud infrastructure.

    So in the end here; the real winners are YOU! EMC Customers, EMC Partners, EMC Service Providers.

    There’s never been a better time to play with ATMOS, and with the public release of the VSA, I’ve been seeing internal and external adoption grow at exponential rates; which means you’ll likely see a lot more perspectives and use-cases of EMC ATMOS which will far trump that of what has been seen up until this point.   More fun, more play, more winners! :)

    References also discussing this:

    Where to get your Cloud On:

    Tags: , , , , , , , , ,
    Posted in ATMOS, Cloud, Free, Storage, Virtualization, emc | Comments (View Comments)

    • Archives

    Google