Flow Data Sources
Flow Data Sources is a comprehensive repository that automatically aggregates and formats Flow ecosystem content into Markdown files optimized for AI ingestion. This resource serves as a centralized knowledge base for AI tools, chatbots, and RAG (Retrieval-Augmented Generation) pipelines.
Overview
The repository contains Python scripts that:
- Crawl Flow-related documentation sites, GitHub repositories, and discussions
- Convert HTML content to Markdown format
- Extract code examples from GitHub repositories
- Capture community discussions and Q&A content
- Merge all content into consolidated files for easy consumption
Key Features
- Daily Updates: Content is automatically refreshed to ensure the latest information
- Structured Format: All content is converted to Markdown for consistent processing
- Comprehensive Coverage: Includes official documentation, code examples, and community discussions
- Optimized for AI: Designed specifically for AI tools, chatbots, and RAG pipelines
- Two Output Options:
all_merged.md
: Complete content including code blocksessentials_merged.md
: Streamlined version without code blocks
How to Use
Flow Data Sources can be integrated with:
- ChatGPT Plugins: Enhance Q&A capabilities with Flow-specific knowledge
- Custom Chatbots: Power Discord/Telegram bots with accurate Flow information
- RAG Systems: Index content in vector databases for semantic search
- Development Tools: Provide context-aware assistance in IDEs like Cursor
Accessing the Content
The merged documentation files are available at:
For integration with AI tools like Cursor or ChatGPT, use the appropriate URL as described in the respective tutorials:
- Use Flow Knowledge Base in Cursor - Learn how to set up Cursor with Flow knowledge bases
- Use Flow Knowledge Base in ChatGPT - Create a custom GPT that understands Flow and Cadence