This project creates marketing videos by combining OpenAI for script and image generation with ElevenLabs for speech synthesis. Images and audio are assembled into a final MP4 using MoviePy and FFmpeg.
- Node.js 18 or newer
- Python 3.8 or newer
- FFmpeg installed and available in your
PATH
- Install Node dependencies:
npm install
- Create a Python virtual environment and install packages:
python3 -m venv venv source venv/bin/activate # On Windows use `venv\Scripts\activate` pip install -r requirements.txt
- Copy
.env.exampleto.envand provide your API keys.
OPENAI_API_KEY– OpenAI API key used for text and image generation.ELEVENLABS_API_KEY– ElevenLabs API key for text‑to‑speech.BASE_URL– Base URL where the app is served (e.g.http://localhost:3000).
Start the Next.js dev server:
npm run devThen open http://localhost:3000 in your browser.
- Use the web interface to generate voice‑over scripts from a marketing prompt.
- Upload an image to create styled variations via DALL·E.
- Select the desired voice and images, then click Generate Video. The resulting MP4 files are saved under
public/output.