Skip to main content
Modern PyTorch Guide home page
Search...
⌘K
Official Docs
GitHub
GitHub
Search...
Navigation
Audio & Speech
Audio Classification
Foundations
Building Models
Performance
Domains
Production
Advanced
API Reference
Community
Forums
NLP & LLMs
Tokenizers
HuggingFace Transformers
Lightning
Text Classification
Language Modeling
LLM Inference
Nlp from scratch
Attention mechanisms
Computer Vision
TorchVision
timm (PyTorch Image Models)
Image Classification
Object Detection
Image Segmentation
Transfer learning
Adversarial examples
Audio & Speech
TorchAudio
Speech Recognition
Audio Classification
Generative Models
Diffusion Basics
HuggingFace Diffusers
Variational Autoencoders
Generative Adversarial Networks
Dcgan
Reinforcement Learning
TorchRL
RL Environments
Policy Gradient Methods
Deep Q-Networks
Ppo
Mario agent
Recommendation Systems
Torchrec intro
Sharding
Video & Multimodal
Video classification
Action recognition
On this page
Audio Classification
Audio & Speech
Audio Classification
Classifying sounds and music
Audio Classification
Spectrograms, mel features, and audio models.
Speech Recognition
Diffusion Basics
⌘I