Skip to content
Draft
Show file tree
Hide file tree
Changes from all commits
Commits
Show all changes
61 commits
Select commit Hold shift + click to select a range
a13da1e
Added tests for loading edges from polars and from fireducks. Added a…
arienandalibi Nov 11, 2025
6afe50e
Adding loading of data (only edges for now) from arrow directly
arienandalibi Nov 16, 2025
c167252
Adding loading of data (only edges for now) from arrow with streaming…
arienandalibi Nov 17, 2025
dc5635d
Added loading of edges from DuckDB, either normally or using streaming.
arienandalibi Nov 18, 2025
7e11387
Added loading edges from fireducks.pandas dataframes. General cleanin…
arienandalibi Nov 18, 2025
2b58f18
Adding flag to stream/not stream data in load_* functions. Will get r…
arienandalibi Nov 19, 2025
eedc1a8
Added functions for load_nodes, load_node_props, load_edges, load_edg…
arienandalibi Nov 21, 2025
a1137db
Cleaned up benchmark print statements
arienandalibi Nov 22, 2025
3d60aa5
Merge branch 'master' into fireducks_polars
arienandalibi Nov 22, 2025
52365af
Ran make stubs
arienandalibi Nov 22, 2025
55bb553
Removed num_rows from DFView. No longer calculating/storing the total…
arienandalibi Nov 24, 2025
c171dff
Cleaned up load_*_from_df functions. load_edge_props/load_node_props …
arienandalibi Nov 24, 2025
6bf05b8
Re-added total number of rows in DFView, but as an Option. We use it …
arienandalibi Nov 25, 2025
228b0f5
Added splitting of large chunks into smaller chunks so that the progr…
arienandalibi Nov 25, 2025
4f428bd
Renamed props to metadata for remaining functions
arienandalibi Nov 25, 2025
39b89ec
Added tests to check equality between graphs created using different …
arienandalibi Nov 25, 2025
7690df5
Changed load_*_metadata_* back to load_*_props_*
arienandalibi Nov 26, 2025
941c7c1
Fixed tests and updated workflow dependencies
arienandalibi Nov 26, 2025
7d25913
Added try-catch blocks for fireducks import in tests
arienandalibi Nov 26, 2025
f72b010
Fixed tests and notebooks
arienandalibi Nov 26, 2025
b38c52a
Fixed invalid function call in test
arienandalibi Nov 26, 2025
62c8ea2
Fixed fireducks package not available on Windows (for now anyway)
arienandalibi Nov 27, 2025
a7bc881
Added load_*_from_df functions to PyPersistentGraph, including load_e…
arienandalibi Nov 27, 2025
1795c18
Merge branch 'master' into fireducks_polars
arienandalibi Nov 27, 2025
925bf83
Merge branch 'master' into fireducks_polars
arienandalibi Nov 28, 2025
cc164a6
Cleaned up load_from_df tests and parametrized them to run for both e…
arienandalibi Nov 28, 2025
8e4e66b
Fixed bug in tests
arienandalibi Nov 28, 2025
446064f
Removed btc dataset benchmarks
arienandalibi Dec 2, 2025
94f2e30
Merge branch 'master' into fireducks_polars
arienandalibi Dec 2, 2025
0ba3e5a
Merge cleanup and fixing python docs errors
arienandalibi Dec 2, 2025
4944687
Adding load_nodes function in python that can take any input from the…
arienandalibi Dec 4, 2025
a6400fa
Fixed CSV reader to calculate column indices for each file separately.
arienandalibi Dec 5, 2025
378c03c
Changed unsafe ArrowArrayStreamReader pointer cast to stream arrow da…
arienandalibi Dec 5, 2025
11a52cf
Added test for loading data from CSV
arienandalibi Dec 5, 2025
ff655e1
Changed CSV reading to avoid loading whole CSV files into memory in a…
arienandalibi Dec 5, 2025
4643c16
Added support for mixed directories containing both CSV and parquet f…
arienandalibi Dec 5, 2025
3d68a99
Added schema argument to load_nodes function
arienandalibi Dec 8, 2025
0a0858e
Fixed load_nodes docs. Added PropType in Python. Added get_dtype_of()…
arienandalibi Dec 9, 2025
94771c9
Merge branch 'fireducks_polars' into consolidate_load_functions
arienandalibi Dec 9, 2025
d2bc247
Fixed casting of columns, can use PropType variants in python to spec…
arienandalibi Dec 9, 2025
deb8cd4
Added casting using pyarrow types as input in the schema
arienandalibi Dec 9, 2025
6944273
Added casting of nested datatypes in the data source. Added test for …
arienandalibi Dec 10, 2025
817d2d0
Added dep:arrow-schema to "python" feature in raphtory-api so that Da…
arienandalibi Dec 11, 2025
62886a3
Added support for dicts as input for the schema. Added equality compa…
arienandalibi Dec 11, 2025
e3013ca
Added CSV options for when loading CSV. Errors if CSV options were pa…
arienandalibi Dec 12, 2025
55e13ab
Added schema support for Parquet and CSV files
arienandalibi Dec 16, 2025
ded1f70
Merge branch 'master' into consolidate_load_functions
arienandalibi Dec 16, 2025
ac02d72
Post merge cleanup
arienandalibi Dec 16, 2025
3b4cdc6
Added test for loading from directories (pure parquet, pure csv, mixe…
arienandalibi Dec 16, 2025
1b01306
Added btc_dataset tests for loading/casting from different sources as…
arienandalibi Dec 17, 2025
3b43e94
Fixed error message displaying incorrectly when the time column is ma…
arienandalibi Dec 18, 2025
e7b91f3
Added malformed parquet test files
arienandalibi Dec 18, 2025
8fbe73a
Fixed CSV loader to return the same error as other loaders when a col…
arienandalibi Dec 18, 2025
53dec1b
Added tests for malformed files
arienandalibi Dec 19, 2025
b961c2a
Added tests for compressed csv files (gz and bz2 compression).
arienandalibi Dec 19, 2025
152ac27
Added test for directory with no CSV/Parquet files
arienandalibi Dec 19, 2025
265732a
Added load functions for edges, node_metadata, edge_metadata, and edg…
arienandalibi Dec 19, 2025
10b41bd
Added pyarrow.DataType import to gen-stubs.py and pyarrow-stubs to de…
arienandalibi Dec 19, 2025
da68b8c
Removed load_*_from_df, load_*_from_pandas, and load_*_from_parquet f…
arienandalibi Dec 20, 2025
c4d7136
Fixed some python tests
arienandalibi Dec 20, 2025
259dcfb
Fixed cast_columns function to not be imported from a python feature …
arienandalibi Dec 20, 2025
File filter

Filter by extension

Filter by extension


Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
1 change: 1 addition & 0 deletions Cargo.lock

Some generated files are not rendered by default. Learn more about how customized files appear on GitHub.

1 change: 1 addition & 0 deletions Cargo.toml
Original file line number Diff line number Diff line change
Expand Up @@ -162,6 +162,7 @@ parquet = { version = "56.2.0" }
arrow-json = { version = "56.2.0" }
arrow-buffer = { version = "56.2.0" }
arrow-schema = { version = "56.2.0" }
arrow-csv = { version = "56.2.0" }
arrow-array = { version = "56.2.0", features = ["chrono-tz"] }
arrow-cast = { version = "56.2.0" }
arrow-ipc = { version = "56.2.0" }
Expand Down
2 changes: 1 addition & 1 deletion python/pyproject.toml
Original file line number Diff line number Diff line change
Expand Up @@ -33,7 +33,7 @@ pyvis = ["pyvis >= 0.3.2"]
networkx = ["networkx >= 2.6.3"]
export = ["raphtory[pyvis,networkx]"]
all = ["raphtory[export,plot]"]
dev = ["docstring_parser >= 0.16", "pandas-stubs", "maturin>=1.8.3", "tox>=4.25"]
dev = ["docstring_parser >= 0.16", "pandas-stubs", "pyarrow-stubs", "maturin>=1.8.3", "tox>=4.25"]
test = ["raphtory[all]", "requests >= 2.31.0", "pyjwt[crypto] >= 2.10.1", "pytest >= 8", "pytest-benchmark >= 5.1.0", "polars >= 1.35.2", "fireducks; sys_platform != 'win32' and python_version < '3.14'", "duckdb >= 1.4.2"]
tox = ["nbmake"]

Expand Down
Loading
Loading