Reading note: High Performance MySQL - Storage Engine

May 19 2018

Storage Engine

MySQL storage engine is a separate layer from MySQL server.
Each table can use different storage engine.
However, MySQL server manages table definition, and its name is dependent on platform: Windows allows case-insensitive name, and Linux is case-sensitive.

We’ll look through a few popular engines:

One of the oldest storage engine of MySQL
Has separate data and index file. The max size is the max file size allowed by OS.
Supports advance index features: GIS, full-text index
Is non-transactional
Has auto/manual repair
Has fast insert/read queries; however, modify queries lock entire table, blocks all read queries.
Has delay key-write to improve performance when writing index
Has compress mode for CD-ROM and DVD-ROM: has small size and fast read due to low disk I/O

One of the most popular engines
Is built for short-live transactions
Is transactional, and implements 4 MySQL isolation levels.
Uses MVCC for high concurrency
Default isolation level is Repeatable Read, which uses next-key locking to prevent phantom read.
Has multiple data files, which separate table data and indexes.
Has clustered indexes
Fast primary key lookup
Secondary indexes also has primary key, so indexes for table with a lot of records can be very big and slow.
Has slower table rebuild time compared to MyISAM
Implements Foreign key constraint, which MySQL does not implement.

Supports only INSERT and SELECT queries
Has much less disk I/O than MyISAM because it buffers data writes and compresses row with zlib when inserting
Each SELECT query requires full table scan
Is usually used for logging and data acquisition
Supports row-level locking and special buffer system for high-concurrency insert
Is non-transactional

Is designed for high performance with redundancy and load-balancing capabilities
Keeps all data in memory and is optimized for primary key lookups
Has share-nothing infrastructure, consists of data nodes, management nodes and SQL nodes (MySQL instances)
Each data nodes is a shard of the cluster’s data, which is replicated to multiple nodes
Performs joins at MySQL server level, and can be extremely slow due to network latency. On another hand, single-table lookups can be very fast.
Is very large and complex
Is usually unsuitable for traditional applications

Uses transaction logs and data files to avoid write-ahead logging, which reduces overhead
Uses MVCC
Supports foreign key constraint
Does not use clustered index
Has BLOB streaming: when combining with BLOB engine, it can stream binary and media file directly in and out of the database

Now is called Aria in MariaDB
Replaces MyISAM as default engine of MariaDB
Uses Aria table format instead of MyISAM, which speeds up some GROUP BY and DISTINCT queries

You should take into account a few factors: