GraFlo
GraFlo: The Universal Data-to-Graph Transformer
Transform tabular (CSV, SQL) and hierarchical (JSON, XML) data into Property Graphs and Knowledge Graphs for seamless ingestion into major graph databases (Neo4j, ArangoDB, TigerGraph). Ideal for building knowledge graphs from structured data sources and integrating with LLM workflows.
🛠️ Stop the Custom Code. Start Graphing.
Instead of building brittle, custom scripts for every dataset and graph database:
Data → Custom Function → Specific Graph
Embrace a universal standard:
Any Data → GraFlo (Declarative Schema) → Any Graph
Stop writing idiosyncratic functions. Define your graph structure once with GraFlo's declarative Schema, and turn Any Data into Any Graph.
Core Concepts
- Property Graphs: Nodes (vertices) and relationships (edges) with properties and unique identifiers. The standard graph model used by modern graph databases.
- Declarative Schema: Define how your data is transformed into a graph using a declarative configuration. Specify vertex/edge definitions, resource mapping, and transforms without writing custom code.
- Resources: Your data sources—table-like (CSV files, SQL databases) or hierarchical (JSON files, XML documents). GraFlo handles parsing and transformation automatically.
Why GraFlo?
- Universal Standard: One declarative Schema works across all data sources and graph databases
- Knowledge Graph Construction: Build structured knowledge graphs from structured data sources (CSV, SQL, JSON, XML)
- LLM Integration: Transform structured data into knowledge graphs ready for LLM-powered applications and RAG systems
- Multi-Tool Power: Handles parsing CSV/SQL/JSON/XML and defining Vertices, Edges, and Properties
- Database-Agnostic: Seamless ingestion into Neo4j, ArangoDB, and TigerGraph
- Declarative Schema: Define your graph structure once, reuse everywhere
- High Performance: Parallel processing for efficient data transformation
- Advanced Features: Compound indexes, blank vertices, filtering, and server-side filtering support
Installation
pip install graflo
Requires Python 3.11 or higher. Available on PyPI.
Resources
- Full documentation and API reference
- GitHub repository - Source code, issues, and contributions
- Examples - CSV, JSON, XML transformation examples
- PyPI package - Install via pip
Complete Knowledge Graph Pipeline
GraFlo is part of GrowGraph's open source knowledge graph ecosystem. For a complete solution:
- Structured Data? Use GraFlo to transform CSV, SQL, JSON, XML into knowledge graphs
- Unstructured Documents? Use OntoCast - Agentic LLM-powered framework for building knowledge graphs from unstructured documents
Together, GraFlo and OntoCast provide a complete solution for building knowledge graphs from both structured and unstructured data, perfect for LLM-enhanced workflows and agentic optimization.