« Enterprise Flash Drives -- An Update | Main | Adventures In Server Marketing »

August 01, 2008

TrackBack

TrackBack URL for this entry:
http://www.typepad.com/services/trackback/6a00d83451be8f69e200e553c8acbb8833

Listed below are links to weblogs that reference Adventures In Storage Marketing:

Comments

Feed You can follow this conversation by subscribing to the comment feed for this post.

Dean

First let me state that my employer is a customer of both EMC and Netapp, along with HDS and IBM. I have no affliation with any vendor, in any way.

Having said that, we are probably not representative of the particular type of company that this post is aimed at - companies who might be considering moving off direct attached SAN storage to Netapp V-series NAS heads.

We have already done that, and it was an incredibly beneficial thing. The company's entire Windows file serving environment was moved from a SAN attached Windows clustered file server to a clustered Netapp V-series. The back-end is IBM DS8100, but that's neither here nor there really.

Other than CIFS file serving, the rest of our critical apps are still FC SAN, and we are not considering changing that.

I'm not here to hype Netapp's gear, but I just wanted to respond to some of your points, from an "already sold" customer's perspective :

"Question #1 -- What's Involved In Installing One Of These?"
Yes, it was a long, drawn out process moving from the Windows file server cluster to the Netapp units, but it was definitely worth the effort.

"Question #2 -- Does My EMC Software Work With This?"
I don't know. I can't see how it's relevant.

"Question #3 -- Does EMC Support This?"
Again, not really relevant in my case. I have Netapp NAS heads in front of IBM disk. But IBM certainly support their disk system, and Netapp support their NAS heads. It's all working out pretty well so far. The NAS head (V-series) is really just another server, as far as I'm concerned, and I would expect my disk vendor to see it the same way. If EMC won't support their disk just because there's a Netapp box accessing it, then... yeah, that's a problem. The question should probably be "Does Netapp support the V-series attaching to EMC disk", and I'm pretty sure they do.

"Question #4 -- Are There Any Performance Impacts?"
Of course. They are fairly frank about it in their FAQ ...
8. IS NETAPP DEDUPLICATION SUITABLE FOR PRIMARY STORAGE?
Yes for “light duty” primary applications. What we mean by light-duty primary storage are volumes that contain primary (1st copy) data, but are not performance-driven. Some examples of this would be VMware VM’s, home directories, document directories, and application volumes that experience heavy I/O loads during the day but are quiescent at night and on weekends. These volumes might very well benefit from deduplication if the system has the performance headroom to support the additional overhead imposed by deduplication.
10. IS THERE ANY WRITE PERFORMANCE OVERHEAD AFTER ENABLING DEDUPLICATION ON A VOLUME?
As data is stored on a deduplication-enabled volume, digital fingerprint files are also stored. Less than 10% write performance overhead is required for this process.
11. IS THERE ANY READ PERFORMANCE OVERHEAD AFTER A VOLUME IS DEDUPLICATED?
When data is read from a deduplication-enabled volume, the read performance penalty will vary depending upon the original vs. deduplicated block layout. Unless the data has been written sequentially, the read impact would be minimal. However, if an application depends upon fast read performance i.e. sequential block recording, deduplication’s impact on read performance should be carefully considered before implementation.
12. CAN THE SYSTEM PERFORM OTHER OPERATIONS WHILE DEDUPLICATION IS RUNNING?
NetApp deduplication runs as a background process and the system can perform any other operation during this process.

"I'm sure NetApp did extensive performance testing about the before-and-after impacts of their proposed solution, and they just forgot to share. Looking forward to seeing this soon, folks ..."

Well, the points above from their FAQ are much more than I usually see from most other storage vendors.


Question #5 -- Do You Have A Process or Tool For Measuring The Potential Savings?
Again, from the FAQ :
19. CAN I ESTIMATE MY SPACE SAVINGS BEFORE INSTALLING DEDUPLICATION?
Yes. A space savings estimation tool (SSET) is available to NetApp and Partner SE’s.

Question #6 -- And, Ultimately, What's The ROI?
Well, for a company like the one I work for who is already using V-series, it is almost a no-brainer. It is apparently a no cost license key I have to type in once, then I simply setup a schedule to do the deduplication overnight. If it reduces our NAS usage of raw, tier 1, FC attached enterprise disk from 11TB to 8 or 9TB, it is most certainly worth it.

Would I migrate from some other file serving environment to Netapp just for this feature? Probably not - it would depend on the circumstances. But, being someone who has already done that migration, I see this as a VERY good thing. I will just wait until it's been in the field for a little while before implementing.

One of the major pain points with Netapp is the overhead required to support their filesystem and the features that go along with it (like snapshots). We have over 11TB of raw disk attached to the NAS heads, but there is only about 8.5TB of usable storage. This deduplication feature can potentially offset some of that pain.

By the way, the FAQ is here. http://communities.netapp.com/servlet/JiveServlet/downloadBody/1060-102-3-1327/FAQ%20NetApp%20Deduplication%2006_11_08.pdf
I think you need an account with the Netapp support site to access it.

Regards
Dean

the storage anarchist

Like Chuck, I've been accused of Not Having a Clue about NetApp, so please - educate me as well...

How many of the V-Series gateways will be needed to front-end a 100TB usable capacity Symmetrix DMX-4 serving 10,000 LUNs to 2500 hosts over 48 4Gb Fibre Channel ports (dual-pathed as Chuck described) for 100% database applications (a mixture of Oracle, Exchange, SQL Server, etc.) with typical 90-95% capacity allocation?

And if these are all using another 30TB usable for TimeFindar/Snap for point-in-time snapshots (which are used for nightly backups to a separate de-duped disk library), how much more or less capacity will be required AFTER the introduction of these V-Series without loss of performance or usable capacity?

Enquiring minds wanna know!

Chuck Hollis

Hi Dean, thanks for your comments -- they're very helpful.

The use case you describe is neither the use case I describe, nor the one promoted by NetApp. It appears you used a V-Series filer to replace a Windows filershare (e.g. NAS).

NetApp's announcement was around block storage applications (e.g. SAN). And they're very different. As you point out, your critical apps are on the FC SAN, and you're not contemplating changing that.

Thanks for attempting to answer the questions, although it was for a very different use case. A few thoughts:

Question #1 -- How Much Effort -- you shared that it was a very lengthy process. Glad it was worth the effort for the NAS move.

Question #2 -- Does My EMC Software Work -- obviously, there were no EMC products involved in your NAS environment, so of course it's an irrelvant question. But if you were using any of the software that is often used with the DS8100, then you'd have a similar problem. Obviously, this was not the case.

Questions #3 -- Does EMC Support This -- Again, the supprot model you describe might be acceptable for a NAS model, after all , it's just a server, as you describe. But I also think it's fair to point out that many customers would find issues with this support model were they considering a SAN use case.

Question #4 -- Are There Any Performance Impacts -- thanks for sharing the NetApp guidance, and I agree, it's better than nothing. However, almost none of it would apply to a SAN use case, and some of the advice appears either incorrect, or insufficient: e.g. there are some VM images that do very heavy work.

Question #5 -- How Much Capacity Will I Save -- didn't know that NetApp had a tool -- I wonder how well it works on block-oriented data such as Oracle databases, Exchange environments et. al. It'd be interesting to find out.

Question #6 -- Is It Worth It -- OK, I agree, if the V-Series is "free", already installed, and all I have to do is type a license key, and there weren't any performance problems -- you're absolutely right, it's a no brainer. But's that not what the NetApp guys were offering, right?

And going from 11TB to 8 or 9 is nice, but wouldn't have paid for the devices, the effort, etc. -- kind of illustrating my point. And I'd be very curious if there was another way of getting that reduction, e.g. asking people to clean up a bit.

The overhead of OnTap and WAFL are legendary. For some reason, the NetApp guys don't like to talk about it much. But you're right, it's part of the equation, isn't it?

Thank you so much for commenting and sharing!


Robert D

Hey Chuck,
I could be mistaken but I don't think Netapp dedupe works with Netapp SAN.

The LUNS presented by the Sym, DMX or CX to a V Series are written with Netapps NAS file system. LUN's in the netapp world are really just Files on top of that.

(You can prove this out by putting a CIFS share on the root of a netapp volume. You'll see the .LUN file and you can write a "DEAR_GRANDMA.DOC file right next to it in the file system)

Anyway, I digress.
This LUN as a file means that dedupe occurs at the NAS level and not the SAN level for Netapp.

With all of the performance limitations associated with that.


Scott Harney

I think the use case Dean is talking about is more typical. I don't think anyone is seriously talking about fronting DMX-4 flash drives with a netapp v-series ;)

More here:
http://searchstorage.techtarget.com/news/article/0,289142,sid5_gci1323253,00.html?track=sy60

"While it's still not likely to be used with a new 3PAR InServ or EMC Symmetrix, "If you have a lot of other legacy storage systems or more conventional storage systems, you can use V-Series with dedupe to repurpose them," said Forrester Research analyst Stephanie Balaouras.

"This won't be a flagship piece of our portfolio -- just another option for customers," Cummings admitted. "

Chuck Hollis

Very much agreed, Scott.

And it does make sense (as I pointed out in the post) that -- sure -- if someone has some older tin that needs a new life, that makes sense.

But that's not how it was presented by them, and there's my beef in a nutshell!

Chuck Hollis

Hi Robert

I reviewed their announcement materials many times, and the implication was clear -- they positioned it as dedupe for SAN with products such as Symmetrix and CLARiiON, which are -- obviously -- SAN products.

Now, whether they actually intendeded people to take them at their word or not is debatable, or whether it'd actually work reasonably enough, and so on.

We shouldn't have to be second-guessing vendors' announcements to this degree. Struck me as immature, it did.

Jonathan

I don't quite understand the fascination of LUN's on filesystems and vice-versa over here. I'm more of an empirical guy.

In my own personal expereinces, I've seen V-Series front-ending several back-end arrays with increased performance. It seems their optimized virtual block layout and integrated read/write cache makes the most out of disk-limited configurations by involving every spindle in all the aggregated I/O's tossed at it.

Add in thin provisioning, fast snaps and optional iSCSI or NAS gateway functionality and I can see the attraction. Heck, dedup is just gravy at this point.

Chuck Hollis

The performance effect you're seeing basically results from spreading the load over available spindles, which works well in some situations.

This is also done by many volume managers (many free!), certain file systems, within the array itself, etc. -- nothing unique to the V-Series.

Reinforces my suspicions on how this product is marketed and sold ... thanks!

Chuck Hollis

I've been scanning the blogs at NetApp, hoping for some sort of answers to some of the questions posed here.

Nothing so far, really. Not surprising.

However, I did see this scathing missive from Val at NetApp, proving -- once again -- blogging is best done when sober.

http://blogs.netapp.com/exposed/2008/08/before-flattery.html

Val -- I know you're trying to help NetApp's cause here, but I don't think this sort of thing is what the corporate PR types are looking for ...

Val Bercovici

Thanks for the cross-link Chuck!

I’m quite pleased this “scathing missive” didn't disappoint. When you swing the rhetorical pendulum that far, don’t be surprised when it swings away as far in search of facts so that it can obtain a natural balance.

As for corp PR folks, what did yours say when Curtis Preston had to recently correct your excessive hyperbole twice regarding your creative interpretation of EMC’s primary dedupe offerings? By EMC’s standards, I’m sure no ethical lines were crossed there either!

Finally, I guess you could say I subscribe to the Dr. Johnny Fever school of blogging productivity – better when pleasantly inebriated :)

Chuck Hollis

I apologize to all my readers, it appears that "Jonathon" didn't really want to offer up a real email address, nor a real website address. I usually catch that sort of stuff before it makes it here ... again, my apologies.

Chuck Hollis

My, you meet all sorts of people on the internet, don't you?

Steven Schwartz - The SAN Technologist

In some of my reading I came across this. Someone forgot to point out that IBM sells NetApp appliances, so I'm sure there is support there.

As for "virtualization" appliances, there are many, everyone I've seen introduces latency and a performance overhead, physics are at work here. However, many people are willing to take a performance hit for increased functionality or some other trade-off.

I cannot believe that Deduplication will be enough of a reason for making a V-Filer implementation on top on EMC or any other storage.

As for performance? Test it! Take the same FC LUN from a DMX and run a benchmark, take the same volume presented through a V-Filer and then presented out as a FC LUN and I'd love to know the difference. If anyone reads this and does it please report back!

Verify your Comment

Previewing your Comment

This is only a preview. Your comment has not yet been posted.

Working...
Your comment could not be posted. Error type:
Your comment has been posted. Post another comment

The letters and numbers you entered did not match the image. Please try again.

As a final step before posting your comment, enter the letters and numbers you see in the image below. This prevents automated programs from posting comments.

Having trouble reading this image? View an alternate.

Working...

Post a comment

Chuck Hollis


  • Chuck Hollis
    VP -- Global Marketing CTO
    EMC Corporation

    Chuck has been with EMC for 16 years, most of them pretty good.

    He enjoys speaking to customer and industry audiences about a variety of technology topics, and -- of course -- enjoys blogging.

    He lives in Holliston, MA with his wife, three kids and three dogs when he's not travelling. Chuck enjoys piano, mountain biking, boating and skiing -- in that order.

    Warning: do not buy him a drink when there is a piano nearby.

My Service Provider Blog

General Housekeeping

  • Frequency of Updates
    I try and write something new 1-2 times per week; less if I'm travelling, more if I'm in the office. Hopefully you'll find the frequency about right!
  • Comments and Feedback
    I'm going to be approving comments before they get posted here. Any information you can share about who you are, how to contact you, what you do for a living, etc. would very much be appreciated.

Twitter Updates

    follow me on Twitter