Sitemap
5 min readOct 4, 2025

Databricks Deletion Vector

Deletion vectors are a storage optimization feature you can enable on Delta Lake tables. By default, when a single row in a data file is deleted, the entire Parquet file containing the record must be rewritten. With deletion vectors enabled for the table, DELETE, UPDATE, and MERGE operations use deletion vectors to mark existing rows as removed or changed without rewriting the Parquet file. Subsequent reads on the table resolve the current table state by applying the deletions indicated by deletion vectors to the most recent table version.

Databricks recommends using Databricks Runtime 14.3 LTS and above to write tables with deletion vectors to use all optimizations. You can read tables with deletion vectors enabled in Databricks Runtime 12.2 LTS and above.

Deletion vectors are enabled by default when you create a new table using a SQL warehouse or Databricks Runtime 14.1 or above.

Note:

Deletion vectors are not enabled by default for materialized views and streaming tables stored in Hive metastore.

CREATE TABLE <table-name> [options] TBLPROPERTIES ('delta.enableDeletionVectors' = true);

ALTER TABLE <table-name> SET TBLPROPERTIES ('delta.enableDeletionVectors' = true);
Press enter or click to view image in full size
Photo by 5010 on Unsplash

Have you ever thought about how deletion operates under the hood in Databricks? When a record is…

Nidhi Gupta
Nidhi Gupta

Written by Nidhi Gupta

Azure Data Engineer 👨‍💻. Heading towards cloud technologies expertise✌️.

No responses yet