Avoid slowdown in get_flags_str by arunkannawadi · Pull Request #258 · esheldon/ngmix

arunkannawadi · 2026-06-01T16:24:50Z

In #248 , we introduced safe casting to uint32, but it seemed to have significantly slowdown the bitwise AND operation that follows. In DP2 production, the calls to get_flags_str are taking a significant amount of time, and this is a low-hanging fruit to fix. Along with it is another minor optimization calculating powers of 2 cumulatively. Together, this brings down the per-call time from ~50 μs to ~8 μs, which is a signficant gain given how many times this functions gets called.

since the current `rubin-env` uses Python 3.13.

and not an array.

esheldon · 2026-06-01T16:46:18Z

Are there any additional tests we should add to make sure we are comfortable?

arunkannawadi · 2026-06-01T17:07:12Z

I had added a test against various integer types and lengths. As long as any of the implementations don't break that test, I think we can be comfortable. Please expand that test if you'd like and happy to go with any implementation of casting as long as the per-call time is under 10 microseconds. If you are willing to drop the support for val to be a 8-bit or 16-bit (unsigned) ints, a casting is not even necessary.

arunkannawadi · 2026-06-01T17:07:20Z

https://github.com/esheldon/ngmix/blob/master/ngmix/tests/test_flags.py

beckermr · 2026-06-01T17:32:37Z

I'm actually confused as to how this PR passes. I am going to try some stuff locally.

arunkannawadi · 2026-06-01T17:36:59Z

FWIW, the CI uses numpy>=2, so the breakage you saw locally with numpy v1 wouldn't occur. Since you want to keep the support for numpy 1, I believe we should drop the last commit and go with the array based thing that this PR originally had. But please try this out locally and make any changes you'd like; you should have edit access to directly add commits here.

beckermr · 2026-06-01T19:03:04Z

OK. Our previous test missed an important case of python ints. I have now rearranged things to explicitly truncate the extra bits.

beckermr · 2026-06-01T19:04:41Z

We may want to emit a warning in this case or simply have the code error instead of truncating.

What do you think @esheldon?

esheldon

I'd also be fine with just crashing if it is out of range.

beckermr · 2026-06-01T19:41:09Z

@arunkannawadi can you time this to see if it is still faster?

beckermr · 2026-06-01T19:45:17Z

My machine is being odd on timing and I see no difference amongst the various options.

arunkannawadi · 2026-06-01T19:48:37Z

This is slow (50 microseconds). A good rule-of-thumb is after casting val must not be of type array.

beckermr · 2026-06-01T20:00:12Z

Thanks. Try the latest commit.

arunkannawadi · 2026-06-01T20:02:51Z

This is good. 6 microseconds or so.

arunkannawadi · 2026-06-09T16:25:25Z

I believe there's going to be a new rubinenv coming soon. if you can cut a release with this and other changes, we can get them in there.

arunkannawadi and others added 4 commits June 1, 2026 11:52

CI Add py3.13 to the test matrix

e4f197d

since the current `rubin-env` uses Python 3.13.

PERF Calculate powers of 2 efficiently

da9cf0b

PERF Cast if-needed to np.uint32

a225920

and not an array.

DOC Add to CHANGES.md

b9435cf

beckermr reviewed Jun 1, 2026

View reviewed changes

Comment thread ngmix/flags.py

PERF Avoid arrays altogether

52964d6

arunkannawadi force-pushed the flagstr branch from 745cb94 to 52964d6 Compare June 1, 2026 16:42

fix: properly truncate top 32 bits

d021e67

esheldon reviewed Jun 1, 2026

View reviewed changes

Comment thread ngmix/flags.py

beckermr added 3 commits June 1, 2026 14:24

fix: weird edge case for py39

ad8ef61

doc: comment on bitmask

3d1989b

doc: update doc string

aeaaaaa

arunkannawadi commented Jun 1, 2026

View reviewed changes

Comment thread ngmix/flags.py Outdated

beckermr added 2 commits June 1, 2026 14:33

fix: wrong thing

4f0a5a5

perf: use array scalar

fd685a2

arunkannawadi commented Jun 1, 2026

View reviewed changes

Comment thread ngmix/flags.py Outdated

fix: avoid the if

f6f7cab

put it all back

7f68c7f

beckermr reviewed Jun 1, 2026

View reviewed changes

Comment thread ngmix/flags.py Outdated

Apply suggestion from @beckermr

deab2a3

beckermr approved these changes Jun 1, 2026

View reviewed changes

arunkannawadi commented Jun 1, 2026

View reviewed changes

Comment thread ngmix/flags.py

test: ensure we use numpy 1 on py39

76880a3

beckermr merged commit 9a4d396 into esheldon:master Jun 5, 2026
6 checks passed

Conversation

arunkannawadi commented Jun 1, 2026

Uh oh!

Uh oh!

esheldon commented Jun 1, 2026

Uh oh!

arunkannawadi commented Jun 1, 2026

Uh oh!

arunkannawadi commented Jun 1, 2026

Uh oh!

beckermr commented Jun 1, 2026

Uh oh!

arunkannawadi commented Jun 1, 2026

Uh oh!

beckermr commented Jun 1, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

beckermr commented Jun 1, 2026

Uh oh!

esheldon left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

beckermr commented Jun 1, 2026

Uh oh!

beckermr commented Jun 1, 2026

Uh oh!

arunkannawadi commented Jun 1, 2026

Uh oh!

Uh oh!

beckermr commented Jun 1, 2026

Uh oh!

arunkannawadi commented Jun 1, 2026

Uh oh!

Uh oh!

Uh oh!

arunkannawadi commented Jun 9, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

beckermr commented Jun 1, 2026 •

edited

Loading