feat: add get_fcidump to output by haneug · Pull Request #253 · faccts/opi

haneug · 2026-06-19T06:52:09Z

Closes Issues

Closes None

Description

Added a small helper class that parses the FCIDUMP file and is able to return the important quantities (one and two-electron integrals) as numpy arrays. This object is populated and returned by output.get_fcidump.

Release Notes

Added

Added get_fcidump for easy access to fcidump file properties.

timmyte · 2026-06-25T15:02:12Z

+        """
+        Parse the fcidump file generated by ORCA and return its data in the `Fcidump` data class.
+        The fcidump file has to be generated by the ORCA job and cannot be generated on-the-fly after the calculation.
+        ORCA can generate fcidump files via the `dumpactints true` flag in the `%output` block.


Suggested change

ORCA can generate fcidump files via the `dumpactints true` flag in the `%output` block.

To generate FCIDUMP files, set the following options:

```

%output

dumpactints true

end

```

IMO this reads more concise.

timmyte · 2026-06-25T15:02:27Z

+
+    def get_fcidump(self) -> Fcidump | None:
+        """
+        Parse the fcidump file generated by ORCA and return its data in the `Fcidump` data class.


Suggested change

Parse the fcidump file generated by ORCA and return its data in the `Fcidump` data class.

Parse the FCIDUMP file generated by ORCA and return its data in the `Fcidump` data class.

timmyte · 2026-06-25T15:02:37Z

+    def get_fcidump(self) -> Fcidump | None:
+        """
+        Parse the fcidump file generated by ORCA and return its data in the `Fcidump` data class.
+        The fcidump file has to be generated by the ORCA job and cannot be generated on-the-fly after the calculation.


Suggested change

The fcidump file has to be generated by the ORCA job and cannot be generated on-the-fly after the calculation.

The FCIDUMP file has to be generated by the ORCA job and cannot be generated on-the-fly after the calculation.

timmyte · 2026-06-25T15:03:33Z

+        Returns
+        -------
+        fcidump_data: Fcidump | None
+            The parsed fcidump data or None if the file is not present or could not be parsed.


Suggested change

The parsed fcidump data or None if the file is not present or could not be parsed.

The parsed FCIDUMP data or None if the file is not present or could not be parsed.

timmyte · 2026-06-25T15:05:38Z

@@ -0,0 +1,161 @@
+"""Parse a potential FCIDUMP file"""


A module that contains one primary class, should also be named after that class -> fcidump.py

timmyte · 2026-06-25T15:21:13Z

+        return tensor
+
+    @classmethod
+    def parse_fcidump(cls, path: Path | str) -> "Fcidump":


IMO from_file() is a more accessible name than parse_fcidump().

Are the FCIDUMP files documented anywhere? Could add the link to the docstring, if so.

timmyte · 2026-06-25T15:22:12Z

+    @classmethod
+    def _get_int(cls, key: str, header: str) -> int:
+        """Return the integer value of the given key."""
+        m = re.search(rf"{key}\s*=\s*(\d+)", header, re.IGNORECASE)


What about negative numbers?
If these lines always look as follows:

key = 1234 [END OF LINE]

You could use .partition("=")

timmyte · 2026-06-25T15:27:24Z

+    @classmethod
+    def _get_int_list(cls, key: str, header: str) -> list[int]:
+        """Return a list of integers corresponding to the given key."""
+        m = re.search(rf"{key}\s*=\s*([\d,\s]+)", header, re.IGNORECASE)


Suggested change

m = re.search(rf"{key}\s*=\s*([\d,\s]+)", header, re.IGNORECASE)

m = re.search(rf"{key}\s*=\s*(\d+(\s*,\s*\d+)+)", header, re.IGNORECASE)

What about a more precise regex!?
This still does not account for leading plus/minus symbol.

timmyte · 2026-06-25T15:29:27Z

@@ -0,0 +1,79 @@
+import textwrap


Please add units test for _get_int_list() and _get_int() specifically.

timmyte · 2026-06-25T15:33:56Z

+
+@pytest.mark.unit
+def test_parse_fcidump_header(tmp_path: Path) -> None:
+    fcidump_text = textwrap.dedent("""\


I would actually not remove the indentation.
If a file format does not rely on indentation, then our file parser shouldn't as well. And the current also implementation does (good implementation 👍)
So I would actively test that.

haneug added this to the 3.0.0 milestone Jun 19, 2026

haneug self-assigned this Jun 19, 2026

haneug added enhancement New feature or request side output Concerning parsing ORCA output labels Jun 19, 2026

haneug marked this pull request as ready for review June 19, 2026 06:53

haneug requested a review from a team as a code owner June 19, 2026 06:53

feat: add get_fcidump to output

17a2832

haneug force-pushed the feature/output-fcidump-parser branch from a2b39ef to 17a2832 Compare June 19, 2026 07:38

haneug added 3 commits June 19, 2026 14:47

doc: improve get_fcidump docstring

ea5ce74

doc: fix typo

b050e0d

feat: Small improvements to Fcidump

97f24c8

timmyte requested changes Jun 25, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

feat: add get_fcidump to output#253

feat: add get_fcidump to output#253
haneug wants to merge 4 commits into
faccts:mainfrom
haneug:feature/output-fcidump-parser

haneug commented Jun 19, 2026

Uh oh!

timmyte Jun 25, 2026

Uh oh!

timmyte Jun 25, 2026

Uh oh!

timmyte Jun 25, 2026

Uh oh!

timmyte Jun 25, 2026

Uh oh!

timmyte Jun 25, 2026

Uh oh!

timmyte Jun 25, 2026

Uh oh!

timmyte Jun 25, 2026

Uh oh!

timmyte Jun 25, 2026

Uh oh!

timmyte Jun 25, 2026

Uh oh!

timmyte Jun 25, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

	Parse the fcidump file generated by ORCA and return its data in the `Fcidump` data class.
	Parse the FCIDUMP file generated by ORCA and return its data in the `Fcidump` data class.

	The fcidump file has to be generated by the ORCA job and cannot be generated on-the-fly after the calculation.
	The FCIDUMP file has to be generated by the ORCA job and cannot be generated on-the-fly after the calculation.

	The parsed fcidump data or None if the file is not present or could not be parsed.
	The parsed FCIDUMP data or None if the file is not present or could not be parsed.

	m = re.search(rf"{key}\s=\s([\d,\s]+)", header, re.IGNORECASE)
	m = re.search(rf"{key}\s=\s(\d+(\s,\s\d+)+)", header, re.IGNORECASE)

Uh oh!

Conversation

haneug commented Jun 19, 2026

Closes Issues

Description

Release Notes

Added

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants