Skip to content

'av' - Audio/Video/Media module #5

@cadebrown

Description

@cadebrown

'mm' should be able to interface (i.e. read, write, and probe) standard multimedia formats, such as images, videos, and audios.

It should use the types present in nx (i.e. tensor0s, nx.array) whenever possible. For example, as soon as you read an image it should be just the raw pixel data, or a tuple of the raw data and relevant metadata

There should be stream writers as well for outputting/writing long streams as well as well as iteration through frames. We should use libav when available and fallbacks for whatever formats we can support without it (but this module won't be super useful when those aren't present)

Types:

  • mm.ImageStream - From a video, yields frames of the video like an iterable
  • mm.AudioStream - from an audio/video, yields chunks of the audio like an iterable

Metadata

Metadata

Assignees

Labels

enhancementNew feature or requestmoduleRelating to a standard module

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions