The anatomy of Hibernate dirty checking mechanism

Last modified:

Are you struggling with performance issues in your Spring, Jakarta EE, or Java EE application?

Imagine having a tool that could automatically detect performance issues in your JPA and Hibernate data access layer long before pushing a problematic change into production!

With the widespread adoption of AI agents generating code in a heartbeat, having such a tool that can watch your back and prevent performance issues during development, long before they affect production systems, can save your company a lot of money and make you a hero!

Hypersistence Optimizer is that tool, and it works with Spring Boot, Spring Framework, Jakarta EE, Java EE, Quarkus, Micronaut, or Play Framework.

So, rather than allowing performance issues to annoy your customers, you are better off preventing those issues using Hypersistence Optimizer and enjoying spending your time on the things that you love!

Introduction

The persistence context enqueues entity state transitions that get translated to database statements upon flushing. For managed entities, Hibernate can auto-detect incoming changes and schedule SQL UPDATES on our behalf. This mechanism is called automatic dirty checking.

The default dirty checking strategy

By default Hibernate checks all managed entity properties. Every time an entity is loaded, Hibernate makes an additional copy of all entity property values. At flush time, every managed entity property is matched against the loading-time snapshot value:

So the number of individual dirty checks is given by the following formula:

$N = \sum\limits_{k=1}^n p_{k}$

where

n = The number of managed entities
p = The number of properties of a given entity

Even if only one property of a single entity has ever changed, Hibernate will still check all managed entities. For a large number of managed entities, the default dirty checking mechanism may have a significant CPU and memory footprint. Since the initial entity snapshot is held separately, the persistence context requires twice as much memory as all managed entities would normally occupy.

Bytecode instrumentation

A more efficient approach would be to mark dirty properties upon value changing. Analogue to the original deep comparison strategy, it’s good practice to decouple the domain model structures from the change detection logic. The automatic entity change detection mechanism is a cross-cutting concern, that can be woven either at build-time or at runtime.

The entity class can be appended with bytecode level instructions implementing the automatic dirty checking mechanism.

Weaving types

The bytecode enhancement can happen at:

Build-time

After the hibernate entities are compiled, the build tool (e.g. ANT, Maven) will insert bytecode level instructions into each compiled entity class. Because the classes are enhanced at build-time, this process exhibits no extra runtime penalty. Testing can be done against enhanced class versions, so that the actual production code is validated before the project gets built.
Runtime

The runtime weaving can be done using:
- A Java agent, doing bytecode enhancement upon entity class loading
- A runtime container (e.g. Spring), using JDK Instrumentation support

If you enjoyed this article, I bet you are going to love my Book and Video Courses as well.

Hibernate 5 improvements

Hibernate 3 has been offering bytecode instrumentation through an ANT target but it never became mainstream and most Hibernate projects are still currently using the default deep comparison approach.
Hibernate 5 has redesigned the bytecode enhancement mechanism, is more reliable than it used to be.

Follow @vlad_mihalcea

High-Performance Java Persistence rocks!

Category: Hibernate Tags: automatic dirty checking, hibernate, Session flush, Training, Tutorial

Vlad Mihalcea

The anatomy of Hibernate dirty checking mechanism

Introduction

The default dirty checking strategy

Bytecode instrumentation

Weaving types

Hibernate 5 improvements

Related

Leave a Reply Cancel reply

Let’s connect

Find Article

Become a Java Champion

Riveran

Book

Video Courses

Sponsored

Training

Hypersistence Optimizer

Tutorials

Social Media

About

Meta

Vlad Mihalcea

The anatomy of Hibernate dirty checking mechanism

Introduction

The default dirty checking strategy

Bytecode instrumentation

Weaving types

Hibernate 5 improvements

Thank you!

Related

Leave a Reply Cancel reply

Let’s connect

Find Article

Become a Java Champion

Riveran

Book

Video Courses

Sponsored

Training

Hypersistence Optimizer

Tutorials

Social Media

About

Meta