Graph Chat

This hack explores:

Populating vertices and edges in a Graph database from an event source
Using that Graph to power a more personalised LLM assistant, using Microsoft Agent Framework
Extracting preferences from the user chat and persisting them back to the graph for future usage.

It makes use of the anonymised H&M dataset, available on Kaggle.

Agents

user_agent:
- Looks up user purchses from the graph
- Recommends items based on customer segment queries
- Finds similar items to existing ones
- Calls preference extraction as an agent inside a memory layer
dynamic_query_agent:
- Experiment to test handing the agent the graph schema and a user query, and dynamically generating a gremlin query. Not safe, there be dragons here :)
signals_extraction_agent:
- Pass the conversation to this agent along with a model schema for preferences, and have it extract those customer preferences from the chat.

High Level Architecture

Graph Structure

Usage

In the super handy Dev UI extension for Agent Framework:

Directory Structure

├── infra/                 # Terraform infrastructure definitions
├── src/
│   ├── agents/            # Agents, tools & memory
│   └── ingestion/         # Data pipeline
│       ├── ingest/        # CSV → Event Hub producer
│       ├── consume/       # Event Hub consumer → Cosmos DB
│       └── models/        # Event data models
└── tests/                 # Unit, integration and ai evals

Getting Started

Clone the repository

git clone https://github.com/damoodamoo/graph_chat.git
cd graph_chat

Open in dev container
- Open the project in VS Code
- When prompted, click "Reopen in Container"
- Or use Command Palette: Dev Containers: Reopen in Container
Authenticate with Azure
```
az login
```
Configure environment variables
- Create a .env file in the project root:
```
cp .env.sample .env
```
Deploy infrastructure
```
task infra:deploy
```
Deploying the infra creates the app.env file that the ingestion and chat agents need.
Download and reduce the source data from Kaggle
- Get your Kaggle API key and set it in .env
- Download the dataset
```
task data:download
```
Now you have the full (and reduced) dataset downloaded.
Run the 'ingest' and 'consume' flows to populate the graph. In one terminal session, start the consumer...
```
task consume:all
```
...and in another session, start the ingest (producer):
```
task ingest:all
```
Run the agent
```
task agent:dev
```
...ask it some stuff.
Run the evals Currently there is an initial set of evals for preference extraction.
```
task eval
```
Other unit / integration tests are run via task test.

This is an experimental project and is intended for upskilling.

Name		Name	Last commit message	Last commit date
Latest commit History 28 Commits
.devcontainer		.devcontainer
.github/workflows		.github/workflows
data		data
docs		docs
infra		infra
src		src
tests		tests
.env.sample		.env.sample
.gitignore		.gitignore
.python-version		.python-version
README.md		README.md
Taskfile.yml		Taskfile.yml
main.py		main.py
pyproject.toml		pyproject.toml
uv.lock		uv.lock

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Graph Chat

Agents

High Level Architecture

Graph Structure

Usage

Directory Structure

Getting Started

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Graph Chat

Agents

High Level Architecture

Graph Structure

Usage

Directory Structure

Getting Started

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages