add a compression analyzer facility #4715

RaduBerinde · 2025-05-12T20:48:47Z

This issue tracks adding a facility for analyzing data in real clusters. The goal is to get a good comparison between various compression algorithms and levels and use it to inform our suggested defaults or to add new adaptive compression algorithms.

We have two ways of doing this:

online: we can sample blocks as they are written to or read from disk. For each sampled blocks, we run all experiments and retain statistics. This approach has the advantage of allowing us to accurately estimate CPU usage differences between algorithms within a specific workload. The disadvantage is that we can only produce data on clusters with versions that include this facility.
"offline": we can add a CLI tool that looks at all relevant files from a store and samples blocks separately from any running process. This is easier to implement and provides a quicker way to obtain data, as a newer binary can be used just for this tool.

Jira issue: PEBBLE-442

Epic CRDB-49140

RaduBerinde · 2025-05-12T20:49:34Z

The plan is to start with the "offline" variant and re-evaluate whether we also want the "online" variant later.

RaduBerinde self-assigned this May 12, 2025

blathers-crl bot added A-storage T-storage labels May 12, 2025

RaduBerinde mentioned this issue May 12, 2025

tool: add db analyze-data command #4710

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

add a compression analyzer facility #4715

add a compression analyzer facility #4715

RaduBerinde commented May 12, 2025 •

edited by exalate-issue-sync bot

Loading

RaduBerinde commented May 12, 2025

Uh oh!

add a compression analyzer facility #4715

add a compression analyzer facility #4715

Comments

RaduBerinde commented May 12, 2025 • edited by exalate-issue-sync bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

RaduBerinde commented May 12, 2025

Uh oh!

RaduBerinde commented May 12, 2025 •

edited by exalate-issue-sync bot

Loading