Agent-Driven Discovery
Multi-agent system that explores datasets and finds interesting insights.
Coming back with the Dell Pro Max GB10. Plan is to give each persona its own tool set (Statistician gets correlation and regression tools, Detective gets outlier and clustering, Storyteller gets sequence and change-point detection) and make them justify opinions using only their own tool outputs. Larger models on the GB10 remove the 7B reasoning ceiling that capped the original version.
$ cat story.md
I love data. I used to pull random datasets from Kaggle and visualize them just for fun. What if AI agents could do the exploration part?
The idea: point a team of agents at a dataset and let them decide what's interesting. An Explorer proposes insights, a Validator pushes back. The back-and-forth is where the magic happens.
This is a showcase project. I run it locally, publish the interesting results as standalone findings with visualizations of how the agents got there. Practice with multi-agent orchestration, monitoring, and the pandas-to-LLM pattern.
$ cat agent-tools.txt
The Explorer agent has access to these data analysis tools:
Dataset structure: columns, types, row count
Summary statistics: mean, median, std, min, max
Value distribution: histogram or top categories
Pearson correlation between two numeric columns
Filter data with pandas expression, return sample
Statistical outliers using IQR method
Random sample of n rows from the dataset
$ cat exploration-path.txt
Sample exploration run showing the Detective persona analyzing earthquake data. Tool errors (red) are normal - the agent recovers and continues exploring. The Skeptic catches a hallucination mid-run when the Explorer starts discussing "budget and revenue" on earthquake data, forcing a correction.
$ diff personas.txt
Same earthquake dataset, three different personas. The Statistician focuses on distributions and correlations, the Detective hunts for anomalies, and the Storyteller looks for narratives in the data.
Select a persona to view their exploration path
$ ./roundtable --replay
Watch a collaborative exploration unfold. Six personas analyze earthquake data, building on each other's insights through 8 rounds of discussion until reaching consensus. Press play to see their thought process.
$ ls ./components
- -Multi-agent orchestration patterns
- -Agent monitoring and observability
- -Pandas-to-LLM data summarization
- -Critic/validator agent patterns
- -Persona-driven prompt engineering
- -Collaborative consensus mechanisms