A modern, interactive demo website for the MAGE (Multi-scale Audio Generation Enhancement) speech enhancement research paper.
- Interactive Audio Demo: Compare noisy input, enhanced output, and clean reference audio
- Multiple Noise Conditions: Test with cafe, traffic, office, babble, and white noise
- Performance Metrics: Real-time display of PESQ and STOI scores
- Responsive Design: Optimized for desktop, tablet, and mobile devices
- Real-time Visualizations: Waveform and spectrogram displays
- Academic Integration: Easy citation copying and paper links
- Fork this repository
- Add your audio samples to the
audio/samples/directory - Enable GitHub Pages in repository settings
- Your demo will be live at
https://yourusername.github.io/MAGE
# Clone the repository
git clone https://github.com/hieugiaosu/MAGE.git
cd MAGE
# Serve locally (Python 3)
python -m http.server 8000
# Or with Node.js
npx serve .
# Or with PHP
php -S localhost:8000Visit http://localhost:8000 to view the demo.
MAGE/
βββ index.html # Main HTML file
βββ styles.css # CSS styling
βββ script.js # JavaScript functionality
βββ README.md # This file
βββ audio/
β βββ README.md # Audio directory documentation
β βββ samples/ # Audio sample files
β βββ clean_reference.wav
β βββ noisy_cafe_0db.wav
β βββ enhanced_cafe_0db.wav
β βββ ... (more samples)
βββ docs/ # Additional documentation (optional)
Place your audio samples in the audio/samples/ directory with this naming convention:
clean_reference.wav # Original clean speech
noisy_{noise_type}_{snr}db.wav # Noisy inputs
enhanced_{noise_type}_{snr}db.wav # Enhanced outputs
- Noise Types: cafe, traffic, office, babble, white
- SNR Levels: 0db, 5db, 10db, 15db
- Format: WAV files, 16kHz sample rate, mono channel
audio/samples/
βββ clean_reference.wav
βββ noisy_cafe_0db.wav
βββ enhanced_cafe_0db.wav
βββ noisy_cafe_5db.wav
βββ enhanced_cafe_5db.wav
βββ noisy_traffic_0db.wav
βββ enhanced_traffic_0db.wav
βββ ... (continue pattern for all noise types and SNR levels)
Edit the following sections in index.html:
- Title and Meta Tags (lines 5-10)
- Hero Section (lines 45-65)
- Abstract Content (lines 85-105)
- Paper Details (lines 280-300)
- Citation Information (lines 310-320)
Update the metrics in script.js:
// Performance metrics for different conditions
const performanceMetrics = {
'cafe': {
'0db': { pesq: 1.8, snr: 18 },
'5db': { pesq: 2.1, snr: 20 },
// ... add your actual metrics
},
// ... other noise types
};Customize the appearance by modifying styles.css:
- Colors: Update CSS custom properties at the top
- Fonts: Change the Google Fonts import
- Layout: Modify grid and flexbox configurations
The demo includes comparison with state-of-the-art methods:
| Method | PESQ β | STOI β | SDR β |
|---|---|---|---|
| Noisy | 1.97 | 0.78 | 8.5 |
| Wiener Filter | 2.12 | 0.82 | 10.2 |
| RNNoise | 2.45 | 0.86 | 12.8 |
| DeepNoise | 2.67 | 0.89 | 14.3 |
| MAGE (Ours) | 3.08 | 0.94 | 17.2 |
- Audio Players: Synchronized playback controls
- Demo Controls: Dynamic sample switching
- Performance Charts: Real-time metric visualization
- Responsive Navigation: Mobile-friendly menu
- Smooth Animations: Scroll-triggered effects
- Waveform Display: Animated comparison visualization
- Spectrograms: Real-time frequency domain representation
- Performance Charts: Interactive metric comparisons
- Mobile Optimization: Touch-friendly interface
- Chrome 70+
- Firefox 65+
- Safari 12+
- Edge 79+
- Any static file server (GitHub Pages, Netlify, Vercel, etc.)
- No backend processing required
- HTTPS recommended for audio playback
- Push code to GitHub repository
- Go to Settings β Pages
- Select source branch (usually
main) - Your site will be available at
https://username.github.io/repository-name
- Connect your GitHub repository
- Set build command: (none needed)
- Set publish directory:
/ - Deploy automatically on commits
- Import your GitHub repository
- Framework preset: Other
- Build command: (none needed)
- Output directory:
./
If you use this demo website for your research, please cite:
@inproceedings{yourname2025mage,
title={MAGE: Multi-scale Audio Generation Enhancement for Speech Enhancement},
author={Your Name and Co-Author Name},
booktitle={ICASSP 2025},
year={2025}
}This project is licensed under the MIT License - see the LICENSE file for details.
Contributions are welcome! Please feel free to submit a Pull Request. For major changes, please open an issue first to discuss what you would like to change.
- Fork the repository
- Create your feature branch (
git checkout -b feature/AmazingFeature) - Commit your changes (
git commit -m 'Add some AmazingFeature') - Push to the branch (
git push origin feature/AmazingFeature) - Open a Pull Request
- Author: Your Name
- Email: your.email@university.edu
- Project Link: https://github.com/hieugiaosu/MAGE
- Demo Link: https://hieugiaosu.github.io/MAGE
- Built with vanilla HTML, CSS, and JavaScript
- Icons from Font Awesome
- Fonts from Google Fonts
- Inspired by modern academic research websites
Note: This is a template for a speech enhancement demo website. Replace the placeholder content with your actual research data, audio samples, and performance metrics.