GAIA: A System for Interactive Analysis on Distributed Graphs Using a High-Level Language

Graph data are prevalent

Traversal on Property Graphs

Challenges of Large Graph Traversal at Alibaba

Current State of the Art

Data-Parallel execution of Gremlin

SCOPE Abstraction

Compilation of Control-Flow Constructs

Dynamic Dependency Tracking

Distributed Execution and Optimizations

Implementation and Evaluation



  • Leverage multiple storage layers

    • Now: in-memory store (immutable graph)

    • Production: enterprise feature

      • Dynamic graph for updates

  • Consistency graph

    • Snapshot is good enough

  • Why to pick the language?

    • Users are not developers, are business experts

    • Gremlin: for users to work with the graph

      • Relative easy for target user

Last updated