Facebook open sources its SQL-on-Hadoop engine, and the web rejoices
Mais uma tecnologia desenvolvida dentro de empresas que não estão na briga com empresas como Cloudera, Hortonworks, etc. Interessante ver como grande parte da inovação tem vindo de empresas como Facebook, Google, Linkedin, etc.
Facebook has open sourced Presto, the interactive SQL-on-Hadoop engine the company first discussed in June. Presto is Facebook’s take on Cloudera’s Impala or Google’s Dremel, and it already has some big-name fans in Dropbox and Airbnb.
Technologically, Presto and other query engines of its ilk can be viewed as faster versions of Hive, the data warehouse framework for Hadoop that Facebook created several years ago. Facebook and many other Hadoop users still rely heavily on Hive for batch-processing jobs such as regular reporting, but there has been a demand for something letting users perform ad hoc, exploratory queries on Hadoop data similar to how they might do them using a massively parallel relational database.
Presto is 10 times faster than Hive for most queries, according to Facebook software engineer Martin Traverso in a blog post detailing today’s news.
Technologically, Hive and Presto are very different, namely because the former…
Ver o post original 399 mais palavras