Skip to main content

Flow Data Sources

Flow Data Sources is a comprehensive repository that automatically aggregates and formats Flow ecosystem content into Markdown files optimized for AI ingestion. This resource serves as a centralized knowledge base for AI tools, chatbots, and RAG (Retrieval-Augmented Generation) pipelines.

Overview

The repository contains Python scripts that:

  • Crawl Flow-related documentation sites, GitHub repositories, and discussions
  • Convert HTML content to Markdown format
  • Extract code examples from GitHub repositories
  • Capture community discussions and Q&A content
  • Merge all content into consolidated files for easy consumption

Key Features

  • Daily Updates: Content is automatically refreshed to ensure the latest information
  • Structured Format: All content is converted to Markdown for consistent processing
  • Comprehensive Coverage: Includes official documentation, code examples, and community discussions
  • Optimized for AI: Designed specifically for AI tools, chatbots, and RAG pipelines
  • Two Output Options:
    • all_merged.md: Complete content including code blocks
    • essentials_merged.md: Streamlined version without code blocks

How to Use

Flow Data Sources can be integrated with:

  • ChatGPT Plugins: Enhance Q&A capabilities with Flow-specific knowledge
  • Custom Chatbots: Power Discord/Telegram bots with accurate Flow information
  • RAG Systems: Index content in vector databases for semantic search
  • Development Tools: Provide context-aware assistance in IDEs like Cursor

Accessing the Content

The merged documentation files are available at:

For integration with AI tools like Cursor or ChatGPT, use the appropriate URL as described in the respective tutorials: