Skip to content
Change the repository type filter

All

    Repositories list

    • Towards Open-source GPT-4o with Vision, Speech and Duplex Capabilities。
      Python
      MIT License
      1801.6k280Updated Jan 16, 2025Jan 16, 2025
    • mini-omni

      Public
      open-source multimodal large language model that can hear, talk while thinking. Featuring real-time end-to-end speech input and streaming audio output conversational capabilities.
      Python
      MIT License
      2703.1k313Updated Nov 5, 2024Nov 5, 2024