The question “do you need a data scientist?” came up a lot when I was a management consultant for a global firm that successfully incubated data science within a few enterprise organizations. It’s hard. The discussion is hard and the culture clash for data scientists is hard. Many approach data science as some dark magic from Hogwarts. It’s not. Investigating a hypothesis takes time. Spontaneously generating data and building a model against that data doesn’t work. Understanding who you need and how they will fit into your organization is challenging. Where do we put them? Who do they interact with? What is the hand-off? Who do we structure around the project? How do you execute a project? Even better, how do we make MONEY? Yet, before we go there, perhaps we should step back a bit and think of this as a strategic question. Because maybe you do need a data scientist and maybe you don’t.
As the momentum in big data solutions reaches a heightened awareness, the demand for deep understanding of technologies like Apache HadoopTM becomes top of mind with our customers. One of the focal events for training and thought leadership around Apache Hadoop is the annual Hadoop Summit hosted by Hortonworks. As you may remember, we partnered with Hortonworks in the fourth quarter of 2012 to bring an on-demand Apache Hadoop product to the open cloud. Currently in private beta, the open cloud offering aims to provide customers with an easy platform to learn, develop and validate both the application and toolset.
This is a guest post written and contributed by Mike Prince, Founder at Corporation Wiki, a Rackspace Hybrid Cloud customer that archives historical corporate data to become the go-to spot for information on corporations and executives.
This is a guest post written and contributed by Ravi Chandran, Founder & CTO at XtremeData, Inc., a Rackspace Cloud Tools Partner. XtremeData provides a scalable, full-featured SQL data warehouse solution for Big Data analytics.
Data is at the heart of your business. As your business grows, your data grows. And with increasing amounts of data, you need a scalable, secure and efficient platform to store it. You also need reliability and performance in a storage platform that will grow with you. Today, we add a new solution to our Dedicated Storage portfolio that will not only scale-out easily, but will also do it simply and seamlessly – EMC Isilon.
We are living in the age of Big Data – there is so much information around us at any given moment that it is impossible to manually sift through and interpret it all. My team at Rackspace is responsible for the system engineering of our Cloud Databases product, and we collect 200,000 metrics every minute to understand the overall health of our system. Data visualization is more than a buzzword to me, it is vital to my team to do our job effectively.