-
Notifications
You must be signed in to change notification settings - Fork 2
Open
Labels
enhancementNew feature or requestNew feature or requestmoduleRelating to a standard moduleRelating to a standard module
Description
'mm' should be able to interface (i.e. read, write, and probe) standard multimedia formats, such as images, videos, and audios.
It should use the types present in nx (i.e. tensor0s, nx.array) whenever possible. For example, as soon as you read an image it should be just the raw pixel data, or a tuple of the raw data and relevant metadata
There should be stream writers as well for outputting/writing long streams as well as well as iteration through frames. We should use libav when available and fallbacks for whatever formats we can support without it (but this module won't be super useful when those aren't present)
Types:
mm.ImageStream- From a video, yields frames of the video like an iterablemm.AudioStream- from an audio/video, yields chunks of the audio like an iterable
Metadata
Metadata
Assignees
Labels
enhancementNew feature or requestNew feature or requestmoduleRelating to a standard moduleRelating to a standard module