Does Presto cache data?

Spread the love

Presto stores intermediate data during the period of tasks in its buffer cache. However, it is not meant to serve as a caching solution or a persistent storage layer.

Click to see full answer.

Besides, is Presto a database?

Presto is a distributed system that runs on Hadoop, and uses an architecture similar to a classic massively parallel processing (MPP) database management system. To make Presto extensible to any data source, it was designed with storage abstraction to make it easy to build pluggable connectors.

Likewise, does presto use MapReduce? Presto is an open-source parallel SQL execution engine. Unlike Hive, Presto doesn’t use the map reduce framework for its execution. Instead, Presto directly accesses the data through a specialized distributed query engine that is very similar to those found in commercial parallel RDBMSs.

In this regard, why is Presto fast?

Reason #1: Presto is Plenty Fast MapReduce operates on a “pull” model and pulls data from the preceding tasks. An upstream stage receives data from its downstream stages, so the intermediate data can be passed directly, thus making the query significantly faster.

How does Apache Presto work?

Presto is a distributed system that runs on a cluster of nodes. Presto’s distributed query engine is optimized for interactive analysis and supports standard ANSI SQL, including complex queries, aggregations, joins, and window functions. Presto architecture is simple and extensible.


Spread the love

Leave a Comment