mirror of
https://github.com/frankwxu/mobile-pii-discovery-agent.git
synced 2026-04-10 12:13:44 +00:00
abb92e0369cb310514655ff773940cdd85724dd7
LLM-Guided SQL Evidence Extraction
This project implements a lightweight LLM-assisted pipeline for discovering and extracting evidentiary artifacts from SQLite databases commonly found in mobile device extractions.
The system separates discovery and extraction to reduce search space, avoid hallucinated SQL, and preserve explainability.
Features
- LLM-guided SQL planning with deterministic execution
- Discovery to extraction workflow
- Fixed evidence types:
EMAIL,PHONE,USERNAME,PERSON_NAME - Safe SQLite execution with REGEXP support
- UNION / UNION ALL–aware column extraction
- Transparent, inspectable state machine
Setup
pip install langchain langgraph python-dotenv
Languages
Jupyter Notebook
85.5%
Python
12.4%
TeX
2.1%