Clever Geek Handbook
📜 ⬆️ ⬇️

Cloudera

Cloudera is an American company, a developer of Apache Hadoop distributions and a number of Hadoop ecosystem software products.

Cloudera, Inc.
Cloudera logo v2.png
Type ofPublic company
Listing on the exchange
Base2009
FoundersChristophe Bishilla,
Amr Avadallah,
Jeffrey Hammerbacher,
Michael Olson
Location USA : Palo Alto
Key figuresReilly, Tom (CEO),
Doug Cutting (Chief Architect)
Industrysoftware development ( ISIC :6201 )
ProductsCommercial version of Hadoop ,
Turnover▲ $ 301 million (2018)
Operating profit▼ - $ 389 million (loss, 2018)
Net profit▼ - $ 386 million (loss, 2018)
Capitalization$ 2.66 billion (September 7, 2018) [1]
Sitecloudera.com

The business model of the company is compared with the Red Hat business - Cloudera creates software distribution kits for organizations based on free software and extracts profits by providing technical support for the supplied solutions [2] [3] . With the “ big data ” technology boom, Cloudera has been repeatedly recognized as one of the most promising companies capable of solving problems of the corresponding class [4] [5] .

In 2018, it absorbed its main competitor in the market for Hadoop-distributions - the American company .

History

The company was founded in October 2008 in Burlingame ( California ) with a starting capital of $ 5 million, the main goal of the business is the commercialization of the Hadoop project. The founders of the company are Christophe Bishilla ( Eng. Chirstophe Bischiglia ), previously working at Google , Amr Avadalla ( Amr Awadallah , vice president of Yahoo , responsible for analysis and data warehouse), Jeffrey Hammerbacher ( Jeff Hammerbacher , Hive project manager at Facebook ) and Michael Olson , Vice-President of Oracle Corporation, previously CEO of Sleepecat , who developed and developed Berkeley DB and was absorbed by Oracle in 2006) [6] . Hammerbacher organized the initial funding of the project by Accel Partners , and Olson led the company. In total, at the initial stage, $ 11 million were attracted, and in addition to Accel among investors, Greylock Partners and business angels Gideon Yu and Caterina Fake are indicated [7] .

Hadoop creators Doug Cutting ( eng. Doug Cutting ) and Michael Cafarella ( Mike Cafarella ), former heads of VMware (Diane Green, Diane Green ) and MySQL AB ( Marten Mikos ) [8] were among the employees hired in the first months. Due to the fact of Cutting's transfer to Cloudera, the company was characterized as a “new standard-bearer Hadoop” [9] .

In 2009, Bishilia was ranked fifth in the list of the 22 best young technological entrepreneurs of the Businessweek weekly [10] , and Hammerbacher was placed on the seventh (out of 15) position in 2010 . When nominated by Bishilia, Cloudera was characterized as a service company providing technical advice on Hadoop, while Hammerbacher’s contribution in 2010 was marked as a transformation of the company's business, making it a supplier of replicable software for organizations [11] .

In November 2011, the company received additional financing in the amount of $ 40 million [12] , in December 2012 - another $ 65 million [13] , among investors of the next rounds are Ignition Partners , Greylock , Accel , Meritech Capital Partners and In-Q-Tel [ 14] [13] .

In October 2012, the company introduced the Impala product., which provides SQL access to data in a cluster running Hadoop, the appearance of such a product was met as a surprise, since the prevailing rhetoric of companies focused on “big data” technologies was the abandonment of traditional technologies based on SQL ( English Old SQL , in consonance with the “ old school ” - old school ) [15] .

In June 2013, Tom Reilly was invited to the position of CEO , who previously led two technology companies to be taken over by major players (Trigo, an MDM system manufacturer, was acquired by IBM in 2004, and was launched at IPO and soon absorbed by Hewlett-Packard in 2010), the event is rated as preparation for either initial placement or the sale of a business [16] . Olson moved to the post of strategic director and chairman of the board. In July 2013, the firm absorbed the British company Myrryx , founded by Sean Owen, one of the main authors of the scalable machine learning framework which is part of the Hadoop ecosystem , announced the appointment of Owen to the position of “Director of Data Science ” ( eng. director of data science ) [17] .

By mid-2013, for five rounds of investment, the company received a total of $ 141 million [16] , and in the next round in March 2014, the company raised another $ 160 million [18] . In March 2014, after the sixth round of investment, Intel acquired an 18% stake in the company for $ 740 million, thus valuing the Cloudera business at approximately $ 4 billion [19] ; at the same time, Intel rejected the development of its own Hadoop distribution a year earlier in favor of promoting solutions from Cloudera [18] . In June 2014, the company acquired the Gazzang data encryption technology developer company [20] .

In April 2017, the company conducted an initial public offering on the New York Stock Exchange , as a result of which it raised $ 215 million [21] . In the fall of 2017, the New York firm-developer of machine learning algorithms Fast Forward Labs was absorbed, the deal was noted as a response to the close integration of Hortonworks with IBM, which focuses on the development of artificial intelligence systems under the Watson program, and abandoned its Hadoop distribution in favor of Hortonworks [22] .

In October 2018, a merger with Hortonworks was announced, while the structure retained the name Cloudera, listing on the stock exchange and the general director, and Hortonworks shareholders received 40% of the shares of the combined company [23] . The transaction was completed on January 3, 2019, despite the total valuation of the two companies at the time of the announcement of $ 5.2 billion, at its completion the capitalization of the combined business amounted to about $ 3 billion [24] . The takeover has actually completed the consolidation stage in the market for commercial Hadoop distributions (of any notable other market participants, only remained with an annual turnover of about $ 175 million for 2018), shifting the focus of competition to wider segments - big data and analytical tools platforms [25] .

CDH

CDH ( Eng. Cloudera's Distribution including Apache Hadoop ) is an Apache Hadoop distribution, which includes a number of related programs and libraries and Cloudera's own development tools, which are distributed for free and are commercially supported for certain Linux distributions ( Red Hat Enterprise Linux , CentOS , Ubuntu , SuSE SLES , Debian ). Among the Hadoop related Apache software projects, the distribution includes: Flume , HBase , Hive , Mahout , Oozie , Pig , Sqoop , Whirr , Zookeeper . In addition, the distribution kit includes its own cluster management subsystem Cloudera Manager , including Hadoop infrastructure deployment scripts in both local and cloud environments ( Rackspace , Amazon EC2 , ), as well as utilities and configurations to support building automation with Apache Maven .

By the beginning of 2012, two versions of CDH were supplied - CDH2 (based on Hadoop 0.20.1) and CDH3 (based on Hadoop 0.20.2). The CDH3 distribution is included in the delivery of the Oracle Big Data appliance hardware and software complex [26] , while the first customer support line for Hadoop is provided by Oracle , and Cloudera provides technical support for more complex issues. In mid-2012, a version of CDH4 based on Hadoop 2.0 (including the YARN module) was released, and CDH4 also includes three of its own products - (Hadoop cluster management browser interface), Impala and Search (full-text and faceted search in HDFS and HBase environments). In 2014 released version CDH5; The CDH6 version, released in the spring of 2018, is based on Hadoop 3.0 (the key innovation of which was the support for noise-tolerant coding for HDFS, which significantly reduces the physical size of clusters) [27] .

Impala

- a massively parallel mechanism for interactively executing SQL queries on data stored in HDFS and HBase , is distributed under the Apache 2.0 license. Unlike Hive , which translates queries in an SQL-like language (HiveQL) into MapReduce tasks performed in batch mode, Impala performs queries in a distributed environment interactively, distributing the request to processing nodes based on its own mechanism, without resorting to MapReduce.

Cloudera Manager

Cloudera Manager is a specialized component that allows you to automate the creation and modification of Hadoop environments, monitor and analyze the performance of processing tasks, and set up alerts on the occurrence of certain events related to the operation of the distributed processing infrastructure. The annual cost of technical support is about $ 4 thousand per cluster node [28] . For Cloudera Manager, there is a free edition ( English free edition ) that works only on clusters consisting of less than 50 nodes and is devoid of a number of properties available to commercial subscribers (such as performance monitoring, configuration version management, Kerberos support ).

Themed Products

Following the Garnter forecast in the data management technologies HYIP cycle of 2017, suggesting that the concept of “Hadoop distribution” itself is outdated in the near future, the company shifted the emphasis in the product offer to thematic kits made up of virtually the same components that are collected in CDH, but aimed at certain specific tasks. So, in 2018, products appeared under the names Data Warehouse (assembly for data warehouses , with a focus on Impala), Operational DB (for operational databases, around HBase , and Spark ), Data Engineering (for ETL and interactive access to data), Data Science (for “ data science ” tasks), Enterprise Data Hub (for enterprise-level data platforms, it is actually a complete assembly of the Hadoop distribution plus a data catalog based on its own SDX component).

The value policy from 2018 is formed around thematic products; Depending on the configuration, subscribers annually pay from $ 4 thousand for supporting each node of Data Engineering and Data Science products to $ 10 thousand for an node of an Enterprise Data Hub product.

Notes

  1. ↑ Cloudera Inc (Unsolved) . Morning Star (September 7, 2018). Archived September 7, 2018.
  2. ↑ Malik, 2009 , I see some interesting parallels between Red Hat Linux and Red Hat Linux, a version of Linux optimized for corporate users.
  3. ↑ Rao, 2011 , Cloudera helps distribute Red Hat does for the Linux framework.
  4. ↑ Nairn, 2010 , EMC has been teamed up with a “high data” ... Startup Cloudera is using the open source software.
  5. ↑ Vance, 2011 , “It will be guys like Jeff.”
  6. ↑ Prickett-Morgan, 2009 , ... Christophe Bisciglia, who led the partnership between academics to play around; Amr Awadallah, a former Yahoo! - Mike Olson, formerly the chief executive officer of the open source database maker of Sleepycat Software (now owned by Oracle); This is a project that leads you to your homepage. warehouse.
  7. ↑ Businessweek, 2010 , Funding: $ 11 million Gideon Yu and Caterina Fake.
  8. ↑ Prickett-Morgan, 2009 , ... Doug Green (she was the founder and former CEO of VMware) and Marten Mikos.
  9. ↑ Handy, Alex. Hadoop creator goes to Cloudera (English) . SD Times (9 October 2009). The appeal date is December 25, 2011. Archived March 11, 2012.
  10. We Businessweek, 2010 , Cloudera co-founder Christophe Bisciglia was one of Bloomberg BusinessWeek's Best Young Tech Entrepreneurs of 2009.
  11. ↑ Businessweek, 2010 , Toughest decision: Changing Cloudera's business.
  12. ↑ Worthen, Ben . Tide Shifts on Web Start-Ups (Eng.) , N. Y .: The Wall Street Journal (22 November 2011). The appeal date was December 28, 2011. “More than $ 85 million for marketing, has been struck since then, including $ 85 million for marketing, Workday Inc. company Marketo Inc. and $ 40 million for data-management company Cloudera Inc. ”
  13. ↑ 1 2 Darrow, Barb Cloudera snares $ 65M more to boost international, enterprise growth (Eng.) . Gigaom (6 December 2012). - “The funding rounds from the Acy Partners, Ignition Partners, In-Q-Tel, and Meritech Capital Partners.” The date of circulation is December 10, 2012. Archived December 17, 2012.
  14. ↑ Rao, 2011 , Cloudera just announced $ 40 million, New York, Acquired, Meritech Capital Partners, and In-Q-Tel.
  15. Ust Brust, Andrew Cloudera's Impala brings Hadoop to SQL and BI . Big Data darling Cloudera's Impala product raises SQL to peer-level with MapReduce (Eng.) . ZDNet (25 October 2012) . - "It was announced that it would be a" The appeal date is January 1, 2014.
  16. 2 1 2 Prickett Morgan Cloudera taps new CEO for inevitable IPO push or acquisition . Former CEO becomes chairman and chief strategist (English) . The Register (June 20, 2013) . The appeal date is January 1, 2014.
  17. ↑ Clark, Jack Cloudera acquisition: It's a Myrrix (cle) . Elephant snorts baby elephant for English learning . The Register (16 July 2013) . The date of circulation is July 17, 2013. Archived August 31, 2013.
  18. 2 1 2 Harris, Dereck Intel jettisons its Hadoop distro and cloudera (English) . Gigaom (27 March 2014). The appeal date is April 1, 2014.
  19. ↑ Clark, Jack Intel is a $ 740m lighter after Cloudera cash shot . Huge funding deal keeps Oracle, IBM away from upstart's yellow elephant (Eng.) . The Register (31 March 2014) . The appeal date is April 1, 2014.
  20. ↑ Liam Tung. Cloudera buys big data encryption outfit Gazzang . Cloudera buys Gazzang offer hidoped clusters (Unsolved) . ZDNet (June 15, 2014) .
  21. ↑ Anita Balakrishnan. Cloudera shares close more than 20% higher on Day 1 (Unidentified) . CNBC (April 28, 2017).
  22. ↑ Rebecca Hill. Cloudera bags AI biz, eyes up IBM customers ... Someone's noticed the Big Blue's deal with Hortonworks (Neopr.) . The Register (September 8, 2017) .
  23. ↑ Kevin Kelleher. Cloudera, Hortonworks Stocks Soar to Announce a Merger ( Unresolved ) . Fortune (October 3, 2018). The appeal date is October 4, 2018.
  24. ↑ Rebecca Hill. Cloudera, Hortonworks merge into amorphous data-managing blob after stockholder vote . Turns off to PR offensive (Neopr.) . The Register (January 7, 2019) .
  25. ↑ Andrew Brust. Cloudera and Hortonworks' merger closes; quo vadis Big Data? . The two biggest hadoop distribution vendors are now one. What does this mean for the Apache Hadoop? (Neopr.) ZDNet (January 4, 2019) .
  26. ↑ Pricket Morgan, Timothy Oracle mounts Cloudera's elephant for big data ride (English) . The Register (10 January 2012). The date of circulation is January 13, 2012. Archived September 6, 2012.
  27. ↑ Tony Baer. Cloudera Enterprise 6 hits the streets . Hadoop 3.0 takes a starring role in the next release of Cloudera's platform . ZDNet (May 22, 2018) . The appeal date is September 23, 2018.
  28. ↑ Pricket Morgan, Timothy. Cloudera gets proactive with Hadoop management (English) . The Register (September 8, 2011). The appeal date is April 15, 2013. Archived April 18, 2013.

Links

  • Vance, Ashlee . Hadoop, a Free Software Program, Finds Uses Beyond Search (eng.) (HTML), N. Y .: The New York Times (17 March 2009), S. B3. The appeal date is December 13, 2011.
  • Prickett Morgan, Timothy Cloudera floats commercial Hadoop distro (eng.) . The Register (16 March 2009). The appeal date is December 13, 2011. Archived March 11, 2012.
  • Taft, Darryl New Cloudera Desktop GUI Simplifies Hadoop for Users (Eng.) . eWeek (2 October 2009). The appeal date is December 13, 2011. Archived May 17, 2012.
  • Malik, Om Is Hadoop Champion Cloudera the Next Red Hat? (eng.) GigaOm (2 October 2009). The appeal date is December 13, 2011. Archived May 17, 2012.
  • Nairn, Geoff . Big Data, Big Blue and Going Green (English) (HTML), L .: Financial Times (27 September 2010). The appeal date is May 29, 2011.
  • Cloudera's Olson Interview About Data Use (English) . Cloudera's Olson Interview About Data Use . Bloomberg (March 22, 2011). The appeal date is December 13, 2011. Archived May 17, 2012.
  • 7. Cloudera. Entrepreneur: Jeff Hammerbacher, 27 (English) . Best Young Tech Entrepreneurs 2010 . Bloomberg Businessweek (April 20, 2010). The appeal date is December 27, 2011. Archived May 17, 2012.
  • Vance, Ashlee . This Tech Bubble Is Different (English) (HTML), Businessweek , N. Y .: Bloomberg (14 April 2011). The appeal date is May 29, 2011.
  • Jackson, Joab . SGI launches Cloudera Hadoop BI clusters (English) (HTML), Framingham: Computerworld (17 October 2011). The appeal date is May 29, 2011.
  • Rao, Leena Cloudera Updates Hadoop Management App With Health Checks, Reporting Features And More (Eng.) . TechCrunch (December 8, 2011). The appeal date is December 27, 2011. Archived May 17, 2012.
Source - https://ru.wikipedia.org/w/index.php?title=Cloudera&oldid=99024813


More articles:

  • Ramenka (Ksegzhi tributary)
  • Chernov, Artyom Sergeevich
  • Pearl (Tale)
  • New Force (Moldova)
  • Procopius (Petridis)
  • Swimming at the World Aquatics Championship 2011 - 400 meters of integrated swimming (men)
  • Parliamentary Elections in Moldova (1994)
  • And the beech came
  • Palace (Kremenets district)
  • Aulilim

All articles

Clever Geek | 2019