Pebblo Documentation
OpenSource Safe DataLoader for Gen AI applications
Project maintained by daxa-ai
Hosted on GitHub Pages — Theme by mattgraham
Overview
Pebblo enables developers to safely load data and promote their Gen AI app to deployment without worrying about the organization’s compliance and security requirements. The project identifies semantic topics and entities found in the loaded data and summarizes them on the UI or a PDF report.
Benefits
- Identify semantic topics and entities in your data loaded in RAG applications
- Accelerate time-to-production by effortlessly meeting your organization’s data compliance requirements
- Mitigate security risks arising from data poisoning and emerging threats.
- Comply with regulations such as the EU AI Act with custom reports and data records
- Support for a wide range of Gen AI development frameworks and data loaders
Components
Pebblo has two components.
- Pebblo Daemon - a REST api application with topic-classifier, entity-classifier and reporting
- Pebblo Safe DataLoader - a thin wrapper to Gen-AI framework’s data loaders
Pebblo Safe DataLoader
currently support Langchain framework. Support for other frameworks like LlamaIndex, Haystack will be added in the upcoming releases.
Documentation