Distillation-based Layer Dropping (DLD): Effective End-to-end Framework for Dynamic Speech Networks

Paper Link: [arxiv][] [ICASSP][]

Code

Coming soon.!!

Overview

Large Speech Models (LSMs) are effective in transcribing audio data while offering high computational complexity and lacking scalability for different computational budgets. To effectively leverage the capabilities of a LSMs for different computational budgets, methods like early exit, adaptive pruning, and layer dropping are used.

DLD is an end-to-end framework for Automatic Speech Recongition (ASR) that performs knowledge distillation from the teacher network to the dynamic / scalable child network, thus, minimizing the difference between models embeddings and optimizing the performance on all computational budgets.

Name		Name	Last commit message	Last commit date
Latest commit History 7 Commits
images		images
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Distillation-based Layer Dropping (DLD): Effective End-to-end Framework for Dynamic Speech Networks

Code

Overview

Architecture

About

Uh oh!

Releases

Packages

hannabdul/DLD4ASR

Folders and files

Latest commit

History

Repository files navigation

Distillation-based Layer Dropping (DLD): Effective End-to-end Framework for Dynamic Speech Networks

Code

Overview

Architecture

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Packages