Part of my role here at EMC is to spend time with writers, bloggers and journalists on various topics. It's a global endeavor, often crossing language and cultural barriers.
Most of the time, it turns out pretty well. What ends up in print is a reasonable reflection of what I said - maybe not perfect, but close enough.
Once in a rare while, the result can get jumbled. When it does, I certainly share a decent amount of the responsibility: was I clear, did I speak slowly, did I make it simple enough?
A few days ago, a write-up of a recorded interview appeared in Storage Newsletter. I was dismayed that it had a bad case of that "lost in translation" problem.
I'm not quite sure how it happened. The questions were reasonable; I thought my answers reasonable -- but the result wasn't ideal. To make matters even more problematic, one of the industry bloggers pounced on one of the many misquotes as "evidence" of a broader set of issues.
Nothing goes unpunished on the social web.
So, in the interest of continual improvement, I thought I'd reprint the entire interview here, along with an improved transcription of what I said. The answers are the same, they're just a bit easier to read. I've also added some expansion points (clearly indicated) as needed.
The Interview
(I’ve omitted the long bio the writer included in the interest of brevity)
StorageNewsLetter: EMC was pushing ILM years ago, not anymore. Now it's big data. Who invented these words?
It was a guy back in 2004 who wrote an article about big data and the new value from information.
An analyst?
No, I think it was academic. They talked about the world of big data -- it first came from universities, then they grew up and found echoes in Wall Street, casinos, things like that.
We didn't start to call it big data until 2010, and then there was a series of articles from McKinsey, Harvard, The Economist. All at once, everyone was talking about big data. So as we got in 2010, 2011 it was about first the cloud, and everybody wanted to talk about it.
[Look at "Big Data" Dynamic Factor Models for Macroeconomic Measurement and Forecasting, by Francis X. Diebold, University of Pennsylvania and NEBR, July 2000]
What does 'big data' mean?
One view is to look at it as a new way to make money from information: We're all familiar with BI, with queries, spreadsheets. Here's this new business model -- people using information to make money by building predictive models -- crystal balls if you will.
It's a very different style of IT. Most IT is about saving money and reducing costs. Big data and big data analytics is about making money.
Additional note: in addition to commercial pursuits, big data is also transforming research, public policy and societal welfare -- indeed the world we live in. More can be found here.
What's the difference between data warehouse and big data analytics?
We're all familiar with data warehousing and BI. Typically we're using relatively small amounts of information. It's very clean, very standardized and you're usually asking about history.
Big data analytics is many data sources, some of them clean, some of them dirty -- and correlating them. The goal is to build a predictive model about the future. People who are good at big data analytics rarely come from the BI world, they tend to have a background in the sciences.
Why did EMC never enter into tape?
We saw tape as a flat, declining market. For the last ten to fifteen years tape market has been declining. At EMC, we work with tape, we interoperate with tape but again our investment strategy is to invest in growth markets so -- for example -- we were very early in flash, in virtualization, in security, in big data analytics.
If I remember well, StorageTek went to Sun, that didn't work out, it then went to Oracle, that didn't work out. We tend to invest in growing -- not shrinking -- markets.
These past years, we didn't see EMC inventing any disruptive technology like scale-out NAS or de-dupe being your more successful technologies in term of growth, but you're just getting them through acquisition (Isilon, Data Domain, Avamar). Is your R&D only updating current hardware and software?
Right now we track something close to four hundred startup companies, so we're always interested in new technologies. There is no way that we as EMC can be smarter than four hundred start-up companies. We're very comfortable with seeing a good idea, buying it and then adding it to EMC: Data Domain, Isilon, XtremIO, VMware, RSA, etc.
And -- at the same time -- we are doing substantial amounts of our own engineering. I look at VPLEX as a disruptive technology, w.g. cache coherence at a distance. Another example is Atmos, now three or four years old, it was the first commercial cloud scale-out object technology in the industry. I consider that disruptive.
But I'll agree with you, more disruptive technologies come to us through acquisition than organically. (note: VMware would be a good example) It's true for lots of big companies. We have our own technology that we engineer, but when you can buy good ideas on the marketplace, you also buy good people. It's kind of a mix of both.
Can we imagine an integration of VNXe and Isilon into a single platform?
If we did, we would end up with something similar to NetApp, and the market doesn't want that.
When we look at the VNX, it's very good at being one storage array - it has dual controllers -- I need some blocks, I need some files, maybe some snaps and similar. There is a great mid-market for that. These people don't get very big (note: capacity-wise).
When people start getting big they want scale-out. They don't want a product that's adapted to scale-out, they want a product that's built for scale-out. But you'll find common features in both.
For example the last Isilon release, known as Mavericks, you'll find some good snaps, good VMware integration, most of the stuff you'll find on the VNXe -- but for an Isilon I need a handful of nodes to get started.
Yes but these nodes could be VNX hardware.
Hardware is (becoming a) commodity, and I think we have to separate the hardware discussion from the software discussion. The hardware almost looks the same, Intel chips, etc.
When you look at what's NetApp doing with their last ONTAP 8.1.1, it's not a good scale-out system. We go head-to-head in competition, and when customers put the 8.1.1 next to Isilon, they want Isilon. Our belief is that when you get to things like scale-out or big data, you need an architecture that's built for it. It's very hard to add it on.
What will be the next storage killing application?
It's going to be big data analytics.
When will you have a dedicated all-flash array?
We announced our XtremIO acquisition this last year. We demonstrated it at VMworld and EMC World. It's an all-flash design. Native de-dupe, scale-out architecture, all flash. It fits very precise parts of the market.
Do you sell it?
No, right now we're in what's called 'early access'. Test it with the customers, make sure it works right. You'll probably see it in the first half of 2013 as a general product. (Note: this is now more like early 2H 2013 I am told) Remember: it's got a very specific profile: random IO/s, places where extreme performance is needed and tiering won't work -- a data set where everything has to run fast.
Will it be integrated as a cache with several tiers behind?
Nope, pure flash, no need for tiering. And because we have native de-dupe, the cost are pretty good. (Note: if someone wanted this, it would be either FAST as found in VMAX and VNX, the upcoming Project Thunder or perhaps VFcache).
Will it be possible to connect disk arrays to this platform?
No. For example, Isilon does scale-out for files. We need a scale-out block device for flash. That's where XtremIO fits. Now within flash we have tiers, MLC, SLC. So there's potential of tiering in the flash world but there's no disk arrays. (Note: VMAX supports the connection of external disk arrays, as do other EMC products such as VPLEX).
What's the percentage of your storage sales in virtualization environment only?
The challenge is that people are doing virtualization but they also have physical stuff. They bought it for VMware but they still have these older applications they put on the array. The percentage of our customers using virtualization is I guess over 80%. With Solaris and AIX virtualization it's probably at 98%. VMware, HyperV and Xen, -- yes – well over 80%.
You are not a big actor in the growing sector of HPC. Will you invest in this market?
We're starting, we are investing in this market, we see an opportunity to bring flash technology to HPC.
With the VNX or Symmetrix line?
Right now we're assembling things out of a VNX portfolio because it's closer to what HPC needs. But we have a lot of potential pieces, something like eight different boards, six different stacks, a lot of potential ingredients. Our current HPC solution is basically a flash-cache plus a VNX with a lot of flash in it. It's very good at check pointing.
And do you offer the Lustre platform?
We support it. We use mostly 10GbE but we're starting to do more work with Infiniband. Storage are becoming servers and I think in the next two years you'll see far more discussion about Infiniband not as a storage connect but as a memory to memory connect to storage.
Several competitors are doing better than EMC in backup software (CommVault, Symantec, Veeam, etc.) in term of growth, even if you invest in some companies (Dantz, Legato) in this sector in the past. What's the problem?
We see it as a backup market. Not as a backup software market. Let's take Data Domain: is that software or hardware? Well if you look at the hardware it's servers.
Yes but you don't have a product like CommVault Simpana.
Avamar is doing quite nicely, Networker hangs in there. We tend to look the market as the data protection market. If you want to cut it into software we do all right with our backup software products.
But what our customers tell us is that they want a backup solution. They want the services, the platform, the consulting, they want all of that. So we tend to look as the market a little bit differently.
What's your current acquisition strategy?
We have publicly announced that we're very interested in investing in next-generation security. In the storage world we're pretty good, there's no space in storage where we think we're underexposed. There's a couple little pieces that are interesting but if you look at our storage portfolio we got two, three flavors of blocks, two flavors of files, object, three management stacks, we've got data protection, etc.
You lost market share vs. NetApp for last known fiscal quarter (storage sales down 1.4% sequentially for EMC, up 2% for NetApp). What's your explanation?
(note – I have no idea where he got those numbers – here are the IDC numbers that most people use – which show the complete opposite – so I try and be diplomatic)
We're in a lot of different markets, high-end enterprise where they're not, consumers with Iomega ...
Yes but they are growing faster in storage sectors than you are.
On the product side, NetApp growth is flat. I think we're showing pretty good growth. But keep in mind we have many storage segments. Some are going quite fast like scale-out NAS, growing incredibly fast. One of the opportunities when you have a portfolio is that some things are hot, some things are less so.
What's your opinion on the more recent acquisition of Cache IQ by NetApp?
NetApp is under pressure to expand their market. If you go to Wall Street they'll ask "Where is the growth going to come from?" It's an interesting acquisition, as very few NetApp acquisitions have worked out well. But the question for me in on the NetApp side, less about the technology.
Why did you sell the majority of ownership of Iomega to Chinese Lenovo?
We're very interested in the Chinese market, and it makes sense to partner your way into the Chinese market. Lenovo has a strategy to grow from the bottom up into the storage business. We saw it as a two-fold. One: it's a great way to partner our way to the Chinese market. And second, most of the consumer electronics comes from China. It’s a strategic play but also a business Lenovo wants to invest in: getting better into the Iomega range.
Is EMC going to continue selling Iomega products?
Perhaps the higher-end. They also do consumer stuff, we're not so interested in that. But for example they do a very nice single controller.
What's your reaction if I tell you that the best quality of EMC CEO Joe Tucci is to recruit around him at high price excellent executives?
Joe has done a amazing job in building an executive team. I work with all these guys -- it's like a dream team. Do you want a cheap team or do you want a good team? And they work together as a team. I get to work with some of the best executives in the industry.
I don't think there is such a thing as cheap leadership -- we're in a very dynamic part of the industry, very competitive and changing very fast. This is not a stable, predictable industry. And we're not just storage. A lot of our executives came from other companies and we're glad to have them. Certain vendors out there are not doing so well.
What's your bet concerning the successor of Tucci as CEO?
Any way we win, there are so many good leaders there that I think our stockholders, our customers, our employees win for any of these executives choices. No bet from me. Do I have my own opinion? Yes. Does it matter? No. We've got six or seven great guys here, any one of them could be the next CEO and we'd be good. Dave Goulden, Pat Gelsinger, Howard Elias, Jeremy Burton, Paul Maritz, every one could be the next guy.
You were at EMC when Moshe Yanai, Mr. Symmetrix, was there? Why did he quit as he appears to be later the founder of remarkable start-ups in storage technology?
EMC was growing from a single product company to a multiple product company. Some people are comfortable with that, some people aren't. As soon as they were multiple horses, Moshe was not so happy. He didn't like the acquisitions but we had a situation where we, as a company, had to build a portfolio -- we couldn't be a single product company anymore. He always liked SRDF that was his baby so it's no surprise he's involved with Axxana.
(note: the interviewer brought up Axxana, I didn't)
What are more precisely your relation with Axxana?
(Note: I believe they might be in EMC Select). We have a program called EMC Select where we will sell on the same purchase order third-party products. We don't directly support it, we don't engineer it, but in creating a solution with EMC product and non-EMC product, like tape. It's a nice value add on top of SRDF with little extra protection.
------
Final Thoughts On This Experience
Part of the opportunity – and challenge – of being a public spokesperson is that you’re not always in control of what gets written. And, of course, there’s no “undo” on the web.
That being said, I hope you don’t mind me taking a few minutes to clean up the interview, and better share what the discussion was about, make it perhaps a bit more readable.
And I’m looking forward to my next interview with Storage Newsletter :)

This makes me rather sad, I thought being able to use xtremeio as a datacenter wide cross array cache would be a massive jump forward.
I was envisioning EMC having a solution with vplex combining with fast sub lun tiering. Then add in VNX, Isislon & VMAX all working with vplex (Isilon may not work well, but both Isilon & Xtreme using inifiniband not impossible); rather than having large islands of SSD cache within individual arrays imagine a centralized global SSD cache. I wouldn't expect to completely remove SSD from the arrays, but reduce the level in them. It's easier for me to make an purchase argument for expensive storage that could be used datacenter wide across NAS/SAN than just one array.
As we all have differing workloads depending upon time of year, the performance of my NAS array is as or more important to me at times than my VMAX when certain processes are running. I was really digging the thought of the infrastructure automatically responding to workload for the entire datacenter without me having to micromanage luns... it was a very, very powerful thought.
Posted by: InsaneGeek | December 07, 2012 at 08:18 AM
Insane Geek
Then you'll be very interested in upcoming announcements during 2013 along these lines. You can see a preview if you search here (or elsewhere) for Project Thunder.
-- Chuck
Posted by: Chuck Hollis | December 07, 2012 at 09:12 AM
I've already had the pitch from "Cache Gordon" for Thunder, and it looks very interesting but it doesn't really help across the board. To my knowledge it's still per server and only helps with block clients not NAS clients. I'm thinking more at the SAN fabric layer (i.e. local VPLEX) and then using NAS gateways that could use that SSD space as well.
If Thunder is looking to go beyond directly to the server that could be interesting indeed.
Posted by: Insane Geek | December 07, 2012 at 09:50 AM
Insane Geek
Ahh, I see what you're after -- sort of a "cache abstraction layer" that could serve all presentation models as a shared and manageable pool, and -- of course -- be independent of array type, etc.
It's an interesting thought, to be sure.
But, as you know, flash is all about speed, and the closer to the server you are, the better. We're getting into serious debates around microseconds already.
The other consideration is that we would now be putting FOUR different flash architectural options in front of our customers: in the server, shared server appliance, your proposal, and -- of course -- flash in the array itself. That would make for a heckuva lot of puzzled looks all around :)
But it's an interesting idea nonetheless -- cache acceleration as a basic shared storage service -- so I'll see if I can go shop it around a bit.
-- Chuck
Posted by: Chuck Hollis | December 07, 2012 at 10:09 AM
Hi Chuck,
Eli Efrat here, CEO of Axxana.
Yes, we are a proud member of EMC Select.
It is truly amazing - I went through the EXACT same experience with StorageNewsletter just last week, with a Q&A they did with me about Axxana. One sentence you wrote: "I'm not quite sure how it happened. The questions were reasonable; I thought my answers reasonable -- but the result wasn't ideal.", was as if you took the words out my mouth last week...
Eli
Posted by: Eli Efrat | December 07, 2012 at 02:05 PM
And... I too am looking forward to my next interview with Storage Newsletter :)
Posted by: Eli Efrat | December 07, 2012 at 02:07 PM
Hi Everyone
Jean-Jacques Malevel (the author of Storage Newsletter) attempted to respond to what I said here, but had difficulties in posting his comments.
I suggested he email them to me, and I'd post them for him. So here it is:
----------
Chuck,
First, you are contesting my figures on the storage business of EMC and NetApp in the interview when I asked you the following question last November 15: "You lost market share vs. NetApp for last known fiscal quarter (storage sales down 1.4% sequentially for EMC, up 2% for NetApp). What's your explanation?" "I have no idea where he got those numbers," you write on your blog.
Here it is:
For EMC in fiscal 3Q12: Quarterly sales for storage products reached $3,763 million down 1.4% sequentially. Look at your quarterly report and also at our news: http://www.storagenewsletter.com/news/financial/emc-fiscal-3q12-financial-results
For NetApp in fiscal 3Q12: Storage products revenue was $996 million, down 2% from 2Q2012. Look also at http://www.storagenewsletter.com/news/financial/netapp-fiscal-2q13-financial-results
On your side, for the demonstration, you publish the figures of IDC on your blog, but mine are the OFFICIAL figures of the two companies for their last financial quarter.
To prove your comment, you write: "Here are the IDC numbers that most people use - which show the complete opposite - so I try and be diplomatic".
You publish some figures of IDC for 2Q12 for the worldwide external storage market enterprise as my question was about the quarterly results from 2Q12 to 3Q12. Look at the right ones, the IDC numbers just published for 3Q12 for the same market: EMC: -2.2% from 2Q12 to 3Q12.
More than that, you have changed many sentences in the interview totally republished on your blog. We usually transcript accurately the answers to questions, just removing some parts without interest, for example when you don't know the answer to a question as it was the case. Between quotation marks, journalists do not change what you say. We will be happy to send you the original audio recording of the interview. That's not our fault if you have not appreciated what you told us on the faithful transcript - even if can happen that we did some few mistakes. We don't "clean up the interview" to "make it perhaps a bit more readable" as you wrote and did in your blog.
It's the first time I encounter during my long career in the IT press such a problem following an interview. And I have published 133 of them with storage executives - mainly with CEOs - since the inception of StorageNewsletter, on paper and then on Internet.
Please add these comments to your article "Lost In Translation?" on your blog before next Monday.
-----------
Jean-Jacques: here it is. My response is in the next comment.
Thanks!
Posted by: Chuck Hollis | December 08, 2012 at 09:28 AM
Hi JJM
I can see by your comments you're frustrated and upset. That's not my goal here.
Let's take it point by point.
First, I think you're confusing "storage revenue" with "storage market share". They are different concepts.
For example, in a declining economy, it is possible to both have declining revenues and increasing market share.
You asked a question about storage market share, so that's what I answered. If you were to ask a question about storage revenues, you would have gotten a different answer.
Going farther, every vendor defines their "storage revenue" a bit differently -- do you include software, rentals, services or not?
IDC does a good job of normalizing different interpretations so we all can compare apples-to-apples. We all use the IDC numbers for that very reason.
Since this interview, IDC has just published more recent numbers (yesterday?), which I haven't had a chance to go look at.
Regarding the transcription -- I found multiple errors that changed the meaning. For example, on the weekend, I don't "lay in bars" -- I play in them, as a musician.
I did provide the link back to your original transcription so readers can read both, and come to their own conclusions.
I, too, have been interviewed hundreds of times by writers around the globe, and this is the first time I've had this experience as well.
As I said before, I thought the questions and answers were good -- but something seemed to get lost in translation. I don't think there was any malice or ill-will on behalf of either party.
And I was quite sincere when I said I was looking forward to our next interview.
Best regards
-- Chuck
Posted by: Chuck Hollis | December 08, 2012 at 09:45 AM
Internet is a beautiful media, it can ruin a perfectly fine Sunday just with a blog post telling you that you did crappy work.
Let me introduce myself. My name is Corentin Béchade, I'm a young editor at Storage Newsletter and the one responsible for this highly debated transcription (For the Axxana one too if Eli reads me)
Don't get me wrong after being corrected by Eli and now by you Chuck I'm probably the faulty one. As you said Chuck we're all seeking continual improvement and I seemed to have learned that the hard way. But there is a few things I would like to get straight.
The interview conducted by Jean-Jacques was interesting to listen to because it was full of what I would call "raw material", that is, the honesty and the natural language between an interviewer and an interviewee.
I think I can speak for JJ and me when I say that that is what we like in an interview. This is why I transcribed and not "translate" it as it was, with no word replacement or politically correct sentences. We are not in the marketing field, we work as journalist, our goal is to transcribe as accurately as possible what is being said.
I admit gladly that my transcription was maybe not perfect, I'm still working to improve my skills, I listened to that interview several time to be sure I wasn't missing anything. I also put our transcript side to side with yours and most of the work is pretty much the same. Telling that things got lost in translation is a bit exaggerated. But when you decide to change or to remove parts of what you said, that's when things start to get complicated.
Nevertheless I admit I can make some mistakes, I'm not bulletproof, but the better solution is to send us an e-mail with corrections that we can look at, not to rewrite what was being said. Because simply there's no other way for us but to look at this and think of it as a personal attack.
We are open to criticism but when you start by saying that the whole interview is badly translated it puts your words against ours and create this whole mess. It's not good publicity for you or us
Sincerly,
Corentin Béchade
Posted by: Corentin Béchade | December 09, 2012 at 09:19 AM
Hello Corentin
Thank you for providing additional color and detail from your perspective. I'm sorry, I didn't mean to ruin your Sunday.
You are right, I could have created a lengthy email to you and your colleague, going line-by-line with corrections and clarifications, and hoped you would have eventually made the edits.
I also had the option of publishing the edits via my blog, as I did.
Let's not forget, there's another potential option here as well -- you could have sent me a rough draft of your transcription prior to publishing on the web, and we could have fixed it then and there.
As you know, once something is published on the web, there's no "undo" key.
If you go back and read the above post, I made it clear that I was not trying to assign blame -- maybe I spoke too fast, maybe I was not clear, maybe I should have kept it much more simple.
Regardless of fault, the end result was not ideal, and had several inaccuracies. I'm sorry if you and JJM see this as a personal attack; that was not my goal in the least.
I had my own less-than-fun day after you published.
For example, I had many people reaching out to me, asking "What happened in that interview? That doesn't sound at all like you ...".
And they were right, it didn't sound like me at all -- neither words nor context.
Several internal EMC people contacted me to say that I had made mistakes in the interview. I had to patiently tell each and every one of them that I had been misquoted.
And a certain industry blogger decided to pounce on one of the misquotes as "evidence" that one of our product programs was in serious trouble, which is not the case.
As I shared with JJM, I provided a link back to the original interview you published, or -- at least -- what ever form it might be in now if you've edited it since. Readers are free to compare, and draw their own judgments.
If you're looking for that live, conversational feel, perhaps the best medium is a podcast or video interview. I do a lot of those, and they come out pretty well.
As I've mentioned before, I look forward to my next interview with Storage Newsletter!
-- Chuck
Posted by: Chuck Hollis | December 09, 2012 at 02:09 PM
not to beat a dead horse but this new blog post may help clear up the revenue versus market share discussion
http://storagegaga.com/is-there-no-one-to-challenge-emc/
Posted by: jonas | December 10, 2012 at 06:13 PM