| 1234567891011121314151617181920212223242526272829303132333435363738394041424344454647484950515253 |
- class AudioMetaData:
- """Return type of ``torchaudio.info`` function.
- This class is used by :ref:`"sox_io" backend<sox_io_backend>` and
- :ref:`"soundfile" backend with the new interface<soundfile_backend>`.
- :ivar int sample_rate: Sample rate
- :ivar int num_frames: The number of frames
- :ivar int num_channels: The number of channels
- :ivar int bits_per_sample: The number of bits per sample. This is 0 for lossy formats,
- or when it cannot be accurately inferred.
- :ivar str encoding: Audio encoding
- The values encoding can take are one of the following:
- * ``PCM_S``: Signed integer linear PCM
- * ``PCM_U``: Unsigned integer linear PCM
- * ``PCM_F``: Floating point linear PCM
- * ``FLAC``: Flac, Free Lossless Audio Codec
- * ``ULAW``: Mu-law
- * ``ALAW``: A-law
- * ``MP3`` : MP3, MPEG-1 Audio Layer III
- * ``VORBIS``: OGG Vorbis
- * ``AMR_WB``: Adaptive Multi-Rate
- * ``AMR_NB``: Adaptive Multi-Rate Wideband
- * ``OPUS``: Opus
- * ``HTK``: Single channel 16-bit PCM
- * ``UNKNOWN`` : None of above
- """
- def __init__(
- self,
- sample_rate: int,
- num_frames: int,
- num_channels: int,
- bits_per_sample: int,
- encoding: str,
- ):
- self.sample_rate = sample_rate
- self.num_frames = num_frames
- self.num_channels = num_channels
- self.bits_per_sample = bits_per_sample
- self.encoding = encoding
- def __str__(self):
- return (
- f"AudioMetaData("
- f"sample_rate={self.sample_rate}, "
- f"num_frames={self.num_frames}, "
- f"num_channels={self.num_channels}, "
- f"bits_per_sample={self.bits_per_sample}, "
- f"encoding={self.encoding}"
- f")"
- )
|