AI Image Generator with Voice Input

A modern Next.js application that generates AI images using FAL.AI's image generation API and supports voice input through Deepgram's speech-to-text API.

Features

🎨 AI Image Generation using FAL.AI's flux/dev model
🎤 Voice-to-Text input using Deepgram
⚡ Real-time image generation with status updates
🎯 High-definition square image output
🎨 Modern UI with Tailwind CSS
🔒 Secure API key handling through proxy routes

Demo

[Add screenshots or GIF here]

Tech Stack

Next.js 14 (App Router)
TypeScript
Tailwind CSS
FAL.AI Client
Deepgram SDK
React Hooks

Getting Started

Prerequisites

Node.js 18+
npm or yarn
FAL.AI API key (Get one here)
Deepgram API key (Get one here)

Installation

Clone the repository:

git clone https://github.com/yourusername/ai-image-generator.git
cd ai-image-generator

Install dependencies:

npm install
# or
yarn install

Create a .env.local file in the root directory:

FAL_KEY=your_fal_ai_key_here
DEEPGRAM_API_KEY=your_deepgram_key_here

Start the development server:

npm run dev
# or
yarn dev

Open http://localhost:3000 in your browser.

Usage

Enter a text prompt describing the image you want to generate, or click the microphone button to use voice input.
If using voice input, speak your prompt and click the button again to stop recording.
Click "Generate" to create your image.
Wait for the image to be generated (usually takes 10-15 seconds).
The generated image will appear below the input field.

Project Structure

src/
├── app/
│   ├── api/
│   │   ├── fal/
│   │   │   └── proxy/
│   │   │       └── route.ts    # FAL.AI proxy route
│   │   │
│   │   └── components/
│   │       └── ImageGenerator.tsx  # Main component
│   │
│   ├── lib/
│   │   └── contexts/
│   │       └── DeepgramContext.tsx  # Deepgram context
│   │
│   ├── layout.tsx
│   └── page.tsx
│
├── .env.local.example
└── package.json

Environment Variables

Create a .env.local file with the following variables:

FAL_KEY=your_fal_ai_key_here
DEEPGRAM_API_KEY=your_deepgram_key_here

Contributing

Fork the repository
Create your feature branch (git checkout -b feature/AmazingFeature)
Commit your changes (git commit -m 'Add some AmazingFeature')
Push to the branch (git push origin feature/AmazingFeature)
Open a Pull Request

License

This project is licensed under the MIT License - see the LICENSE file for details.

Acknowledgments

FAL.AI for their amazing image generation API
Deepgram for their speech-to-text API
Next.js team for the awesome framework
Tailwind CSS for the styling utilities

Name		Name	Last commit message	Last commit date
Latest commit History 30 Commits
paths		paths
public		public
src		src
.cursorrules		.cursorrules
.env.local.example		.env.local.example
.eslintrc.json		.eslintrc.json
.gitignore		.gitignore
.replit		.replit
LICENSE		LICENSE
README.md		README.md
next.config.mjs		next.config.mjs
package-lock.json		package-lock.json
package.json		package.json
postcss.config.mjs		postcss.config.mjs
tailwind.config.ts		tailwind.config.ts
tsconfig.json		tsconfig.json

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

AI Image Generator with Voice Input

Features

Demo

Tech Stack

Getting Started

Prerequisites

Installation

Usage

Project Structure

Environment Variables

Contributing

License

Acknowledgments

About

Uh oh!

Releases

Packages

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

AI Image Generator with Voice Input

Features

Demo

Tech Stack

Getting Started

Prerequisites

Installation

Usage

Project Structure

Environment Variables

Contributing

License

Acknowledgments

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Packages