One sized cache by esheldon · Pull Request #495 · esheldon/fitsio

esheldon · 2026-05-11T17:30:01Z

Limit cacheing of compressed data reads to a single tile.

After changing cache, make sure reads at size of tile still working

esheldon · 2026-05-11T18:23:47Z

OK, we'll still wait for Eli

erykoff · 2026-05-11T22:25:00Z

Only a couple attempts to do the timing tests here, but I think this is generally accurate.

Reading the large file in healsparse that triggered this bug on a fast local ssd I get:

default, using fitsio it takes 46s with a peak of 12Gb of memory down to 160Mb
astropy fits instead of fitsio, it takes 67s with a peak of 1Gb of memory down to 160Mb
Patch healsparse to close/open the fits file to clear the cache: 53s, with a peak of 231Mb
The patch on this PR takes 49s with a peak of 206Mb.

So this patch does fix the problem with a very slight performance penalty. It is faster than repeatedly closing and opening the file. And it has the lowest overall memory usage.

beckermr · 2026-05-11T22:41:15Z

Can we increase the cache size to a few? Maybe 3?

erykoff · 2026-05-11T22:42:38Z

I have no idea why this should be slower though, since I'm reading sequentially one compressed block at a time.

beckermr · 2026-05-11T23:23:19Z

49 vs 46 is not a big difference. I'm mostly concerned for applications with more randomized access patterns. We should use a bit more memory to speed computation.

esheldon · 2026-05-15T13:50:07Z

This version uses cache size 3 tiles

esheldon · 2026-05-15T13:59:44Z

In my tests on Eli's file the new version is definitely slower.

The main difference with the new one is it is constantly freeing memory.

esheldon · 2026-05-15T14:05:45Z

OK, this seems to be slowness in recent unreleased fitsio, not this particular PR.

I tried the recent version with updated cfitsio, before this PR, and it is also slower

esheldon · 2026-05-15T14:11:25Z

So I tried v1.3.0 compiled on my machine vs. v1.3.0 installed from conda, and the conda version is faster.

so my own slower tests may just be an issue with compilers on my system

beckermr · 2026-05-15T18:58:22Z

 +      // tilecol = (row - 1) % ((long)(((outfptr->Fptr)->znaxis[0] - 1) / ((outfptr->Fptr)->tilesize[0])) + 1);
 +      // Cache only a single tile
-+      tilecol = 0;
+      tilecol = (row - 1) % NTILEBINS;


Just so I make sure I understand, if I read two tiles spaced by NTILEBINS in the order t1 -> t1 + NTILEBINS -> t1, then I will have a cache miss on the second read of t1 even though for NTILEBINS >=2, there is enough room in the cache that no cache miss should be needed?

Is there a simple, but more complex hashing function we could use here to help randomize this a bit more to avoid this common case?

This SO post has some simple ones. It appears the SDBM one might be a good choice.

The implementation for SDBM is

uint32_t SDBM_hash(const uint8_t* buf, size_t size) { uint32_t hash = 0; for (size_t i = 0; i < size; i++) hash = (hash << 6) + (hash << 16) - hash + buf[i]; /* hash * 65599 + byte */ return hash; }

That code snippet is from https://stackoverflow.com/a/77342581 and is fine to use with proper attribution.

I agree this cache mechanism is not good. I considered implementing something better but I thought we might be getting in the the territory of "patch files are not a good way to keep track of these changes". Especially since the chance of upstream accepting the patch is probably zero.

That said, I'd be happy to do it if we think keeping the patches alive is worth it. But in that case I don't think the current patch management is going to be sufficient because it only supports a single patch file. Yes, we can combine patches but at that point we might as well let git manage the cfitsio files.

only supports a single patch file per original file

esheldon added 4 commits May 11, 2026 13:11

patches to force caching single compressed tile

5f164eb

Update CHANGES.md

665bd0d

add check chunk reading still works

957c1bc

After changing cache, make sure reads at size of tile still working

satisfy the dog

f7e2e6e

This was referenced May 11, 2026

Limit memory usage of cached reads from compressed images #492

Open

Add second patch to imcompress.c limiting cache to one tile #493

Open

beckermr approved these changes May 11, 2026

View reviewed changes

esheldon added 3 commits May 15, 2026 09:24

add patches/fitsio.h.patch

6387963

Add NTILEBINS which controls number of cached tiles

78b320a

clean up patch, modify CHANGES.md

ecc1643

beckermr reviewed May 15, 2026

View reviewed changes

beckermr added 3 commits June 4, 2026 16:19

Merge branch 'master' into one-sized-cache

f0f1be3

Update imcompress.c patch for compression logic

5c8164c

Refactor compression return logic in imcompress.c

c13f2cf

Conversation

esheldon commented May 11, 2026

Uh oh!

esheldon commented May 11, 2026

Uh oh!

erykoff commented May 11, 2026

Uh oh!

beckermr commented May 11, 2026

Uh oh!

erykoff commented May 11, 2026

Uh oh!

beckermr commented May 11, 2026

Uh oh!

esheldon commented May 15, 2026

Uh oh!

esheldon commented May 15, 2026

Uh oh!

esheldon commented May 15, 2026

Uh oh!

esheldon commented May 15, 2026

Uh oh!

beckermr May 15, 2026

Choose a reason for hiding this comment

Uh oh!

beckermr May 15, 2026

Choose a reason for hiding this comment

Uh oh!

beckermr May 15, 2026

Choose a reason for hiding this comment

Uh oh!

esheldon May 18, 2026

Choose a reason for hiding this comment

Uh oh!

esheldon May 18, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants