Big Data, Hadoop Standards Group: Who's In, Who's Missing?
by
FEB 19, 2015 1:12pm ET
First, the big picture. The Open Data Platform will "promote big data technologies based on open source software from the Apache Hadoop ecosystem and optimize testing among and across the ecosystem’s vendors. These efforts will accelerate the ability of enterprises to build or implement data-driven applications," according to a statement from the association's founders.
In decades past, similar standards groups have united to promote Linux, Unix, WiFi and other emerging platform technologies. But standards groups can also resemble the political landscape -- as vendors sometimes break off and move to the extreme left or right of the group's stated goals.
Big Names to Start
Several industry giants and startups are driving the Open Data Platform group -- including Altiscale, Capgemini, CenturyLink, EMC, GE, Hortonworks, IBM, Infosys, Pivotal, SAS, Splunk, Teradata Verizon and VMware.Still, some key names also are missing from effort. Chief among them:
- Cloud services providers like Amazon Web Services, Google Cloud Platform, Microsoft Azure and Rackspace -- each of which promotes various Hadoop efforts on the public cloud.
- Hadoop specialists like Cloudera (which now has a $100 million annual revenue run rate) and rival MapR -- both of which compete with Hortonworks.
- Hardware providers that ship servers and tune their systems for Hadoop -- including HP, Dell and others.
- NoSQL database providers that work closely with the Hadoop industry -- such MongoDB and others.
Eight Core Goals
Despite those points of debate, the Open Data Platform's powerful members should be able to flex some muscle in the weeks and months ahead. The existing members say they have eight core goals:- Accelerate the delivery of big data solutions by providing a well-defined core platform to target.
- Define, integrate, test and certify a standard "ODP Core" of compatible versions of select big data open source projects. This area, Information Management believes, could be particularly tricky as vendors potentially try to promote their wares into the standards-based platform.
- Provide a stable base against which big data solutions providers can qualify solutions.
- Produce a set of tools and methods that enable members to create and test differentiated offerings based on the ODP core.
- Reinforce the role of the Apache Software Foundation (ASF) in the development and governance of upstream projects. This is particularly important, Information Management believes, since Apache has been so instrumental in Hadoop's development so far.
- Contribute to ASF projects in accordance with ASF processes and Intellectual Property guidelines. Here again, Information Management believes the group is trying to stress open, vendor-neutral collaboration through ASF.
- Support community development and outreach activities that accelerate the rollout of modern data architectures that leverage Apache Hadoop.
- Will help minimize the fragmentation and duplication of effort within the industry.
But on the flip side, there are numerous examples of open standards -- HTTP, Ethernet, WiFi, etc. -- that ultimately benefitted both vendors and their customers. Open Data Platform certainly hopes its efforts mirror those hugely successful outcomes.
No comments:
Post a Comment