feat: Implement icebug-disk format#429
Conversation
7f26fa3 to
2c9f811
Compare
|
How's this different from: |
updating description. Give me a minute |
|
Improvements:
However, the main motivation behind adding new classes is having separate entities for icebug-format because it may diverge from existing columnar implementations. As discussed earlier, we are planning to use/replace exisiting parquet or arrow classes for normal tables or delegate to duckDB anyway |
|
With this implementation, we can have a working db as an output from non-icedisk-to-icedisk-converter or user can pass the The main issue is path though. If the user moves the db or path, he/she might need to trigger Couple of clarifications:
|
|
The improvements are nice. But we can't have two implementations with 2x the bugs. Need to remove one and justify the other with benchmarks/data comparing the improvement. DuckDB and Lance could be additional ColumnarBaseTable implementations. They don't remove the need for parquet. |
It already creates a metadata table thusly: Yes, this is a good place to add
We can specify that The distinction between schema.cypher and catalog entry are mechanical. I don't think we should spend a lot of time specifying that or we'll be duplicating a significant chunk of docs.ladybugdb.com. However, getting other Graph databases and analytical packages to adopt icebug is an explicit goal. So I'd try to strike a balance between over-specifying (as opposed to just refer to reference implementation) and adding ladybug specific assumptions that would rub the non-ladybug people the wrong way. |
|
dataset addition PR: LadybugDB/dataset#1 |
I will run a benchmark tmrw and attach the results But, what about support for normal parquet tables? We can just remove them from docs until we change the existing classes to normal tables |
I will update the tool and add it in this repo tmrw |
Ladybug-Memory/icebug-format#2 (comment) If the issue is |
Closes #469