mirror of
https://github.com/frankwxu/mobile-pii-discovery-agent.git
synced 2026-04-10 12:13:44 +00:00
6dfdb6d7de73ecac2a289fd2fa4e88d302ae3412
LLM-Guided SQL Evidence Extraction
This project implements a lightweight LLM-assisted pipeline for discovering and extracting evidentiary artifacts from SQLite databases commonly found in mobile device extractions.
The system separates discovery and extraction to reduce search space, avoid hallucinated SQL, and preserve explainability.
Features
- LLM-guided SQL planning with deterministic execution
- Discovery to extraction workflow
- Fixed evidence types:
EMAIL,PHONE,USERNAME,PERSON_NAME - Safe SQLite execution with REGEXP support
- UNION / UNION ALL–aware column extraction
- Transparent, inspectable state machine
Setup
pip install langchain langgraph python-dotenv
Languages
Jupyter Notebook
85.5%
Python
12.4%
TeX
2.1%