MAGE: Speech Enhancement Demo Website

A modern, interactive demo website for the MAGE (Multi-scale Audio Generation Enhancement) speech enhancement research paper.

🎯 Features

Interactive Audio Demo: Compare noisy input, enhanced output, and clean reference audio
Multiple Noise Conditions: Test with cafe, traffic, office, babble, and white noise
Performance Metrics: Real-time display of PESQ and STOI scores
Responsive Design: Optimized for desktop, tablet, and mobile devices
Real-time Visualizations: Waveform and spectrogram displays
Academic Integration: Easy citation copying and paper links

🚀 Quick Start

Option 1: GitHub Pages (Recommended)

Fork this repository
Add your audio samples to the audio/samples/ directory
Enable GitHub Pages in repository settings
Your demo will be live at https://yourusername.github.io/MAGE

Option 2: Local Development

# Clone the repository
git clone https://github.com/hieugiaosu/MAGE.git
cd MAGE

# Serve locally (Python 3)
python -m http.server 8000

# Or with Node.js
npx serve .

# Or with PHP
php -S localhost:8000

Visit http://localhost:8000 to view the demo.

📁 Project Structure

MAGE/
├── index.html              # Main HTML file
├── styles.css              # CSS styling
├── script.js               # JavaScript functionality
├── README.md               # This file
├── audio/
│   ├── README.md           # Audio directory documentation
│   └── samples/            # Audio sample files
│       ├── clean_reference.wav
│       ├── noisy_cafe_0db.wav
│       ├── enhanced_cafe_0db.wav
│       └── ... (more samples)
└── docs/                   # Additional documentation (optional)

🎵 Audio Sample Setup

Required Audio Files

Place your audio samples in the audio/samples/ directory with this naming convention:

clean_reference.wav                 # Original clean speech
noisy_{noise_type}_{snr}db.wav     # Noisy inputs
enhanced_{noise_type}_{snr}db.wav  # Enhanced outputs

Supported Configurations

Noise Types: cafe, traffic, office, babble, white
SNR Levels: 0db, 5db, 10db, 15db
Format: WAV files, 16kHz sample rate, mono channel

Example Files Needed

audio/samples/
├── clean_reference.wav
├── noisy_cafe_0db.wav
├── enhanced_cafe_0db.wav
├── noisy_cafe_5db.wav
├── enhanced_cafe_5db.wav
├── noisy_traffic_0db.wav
├── enhanced_traffic_0db.wav
└── ... (continue pattern for all noise types and SNR levels)

🛠️ Customization

Updating Paper Information

Edit the following sections in index.html:

Title and Meta Tags (lines 5-10)
Hero Section (lines 45-65)
Abstract Content (lines 85-105)
Paper Details (lines 280-300)
Citation Information (lines 310-320)

Modifying Performance Metrics

Update the metrics in script.js:

// Performance metrics for different conditions
const performanceMetrics = {
    'cafe': { 
        '0db': { pesq: 1.8, snr: 18 }, 
        '5db': { pesq: 2.1, snr: 20 },
        // ... add your actual metrics
    },
    // ... other noise types
};

Styling Changes

Customize the appearance by modifying styles.css:

Colors: Update CSS custom properties at the top
Fonts: Change the Google Fonts import
Layout: Modify grid and flexbox configurations

📊 Performance Comparison

The demo includes comparison with state-of-the-art methods:

Method	PESQ ↑	STOI ↑	SDR ↑
Noisy	1.97	0.78	8.5
Wiener Filter	2.12	0.82	10.2
RNNoise	2.45	0.86	12.8
DeepNoise	2.67	0.89	14.3
MAGE (Ours)	3.08	0.94	17.2

🎨 Features Overview

Interactive Components

Audio Players: Synchronized playback controls
Demo Controls: Dynamic sample switching
Performance Charts: Real-time metric visualization
Responsive Navigation: Mobile-friendly menu
Smooth Animations: Scroll-triggered effects

Visual Elements

Waveform Display: Animated comparison visualization
Spectrograms: Real-time frequency domain representation
Performance Charts: Interactive metric comparisons
Mobile Optimization: Touch-friendly interface

🔧 Technical Requirements

Browser Support

Chrome 70+
Firefox 65+
Safari 12+
Edge 79+

Server Requirements

Any static file server (GitHub Pages, Netlify, Vercel, etc.)
No backend processing required
HTTPS recommended for audio playback

📝 Deployment Options

GitHub Pages

Push code to GitHub repository
Go to Settings → Pages
Select source branch (usually main)
Your site will be available at https://username.github.io/repository-name

Netlify

Connect your GitHub repository
Set build command: (none needed)
Set publish directory: /
Deploy automatically on commits

Vercel

Import your GitHub repository
Framework preset: Other
Build command: (none needed)
Output directory: ./

🎓 Citation

If you use this demo website for your research, please cite:

@inproceedings{yourname2025mage,
  title={MAGE: Multi-scale Audio Generation Enhancement for Speech Enhancement},
  author={Your Name and Co-Author Name},
  booktitle={ICASSP 2025},
  year={2025}
}

📄 License

This project is licensed under the MIT License - see the LICENSE file for details.

🤝 Contributing

Contributions are welcome! Please feel free to submit a Pull Request. For major changes, please open an issue first to discuss what you would like to change.

Development Setup

Fork the repository
Create your feature branch (git checkout -b feature/AmazingFeature)
Commit your changes (git commit -m 'Add some AmazingFeature')
Push to the branch (git push origin feature/AmazingFeature)
Open a Pull Request

📞 Contact

Author: Your Name
Email: your.email@university.edu
Project Link: https://github.com/hieugiaosu/MAGE
Demo Link: https://hieugiaosu.github.io/MAGE

🙏 Acknowledgments

Built with vanilla HTML, CSS, and JavaScript
Icons from Font Awesome
Fonts from Google Fonts
Inspired by modern academic research websites

Note: This is a template for a speech enhancement demo website. Replace the placeholder content with your actual research data, audio samples, and performance metrics.

Name		Name	Last commit message	Last commit date
Latest commit History 9 Commits
.github/workflows		.github/workflows
sample		sample
.DS_Store		.DS_Store
README.md		README.md
coming-soon.html		coming-soon.html
favicon.png		favicon.png
index.html		index.html
instructions.md		instructions.md
script.js		script.js
styles.css		styles.css

Folders and files

Latest commit

History

Repository files navigation

MAGE: Speech Enhancement Demo Website

🎯 Features

🚀 Quick Start

Option 1: GitHub Pages (Recommended)

Option 2: Local Development

📁 Project Structure

🎵 Audio Sample Setup

Required Audio Files

Supported Configurations

Example Files Needed

🛠️ Customization

Updating Paper Information

Modifying Performance Metrics

Styling Changes

📊 Performance Comparison

🎨 Features Overview

Interactive Components

Visual Elements

🔧 Technical Requirements

Browser Support

Server Requirements

📝 Deployment Options

GitHub Pages

Netlify

Vercel

🎓 Citation

📄 License

🤝 Contributing

Development Setup

📞 Contact

🙏 Acknowledgments

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages