Discuss Apache Presto JApache Presto is an open source distributed SQL engine. Presto originated at Facebook for data analytics needs and later was open sourced. Now, Teradata joins Presto community and offers support. Apache Presto is very useful for performing queries even petabytes of data. Extensible architecture and storage plugin interfaces are very easy to […]
Category Archives: apache Presto
Apache Presto – HIVE Connector The Hive connector allows querying data stored in a Hive data warehouse. Prerequisites Hadoop Hive Hopefully you have installed Hadoop and Hive on your machine. Start all the services one by one in the new terminal. Then, start hive metastore using the following command, hive –service metastore Presto uses Hive […]
Apache Presto – KAFKA Connector The Kafka Connector for Presto allows to access data from Apache Kafka using Presto. Prerequisites Download and install the latest version of the following Apache projects. Apache ZooKeeper Apache Kafka Start ZooKeeper Start ZooKeeper server using the following command. $ bin/zookeeper-server-start.sh config/zookeeper.properties Now, ZooKeeper starts port on 2181. Start Kafka […]