Refactor C++ SingleStreamDecoder to consolidate addStream into constructor

Right now, the C++ `SingleStreamDecoder` class has two constructors: https://github.com/meta-pytorch/torchcodec/blob/1ea235aa6b71bc9d6ce070d4b849dc60974082b5/src/torchcodec/_core/SingleStreamDecoder.h#L34-L45

And separate from there, there is a public API for adding streams of different media types: https://github.com/meta-pytorch/torchcodec/blob/1ea235aa6b71bc9d6ce070d4b849dc60974082b5/src/torchcodec/_core/SingleStreamDecoder.h#L90-L97

Note that there is also a private member function that both of those call: https://github.com/meta-pytorch/torchcodec/blob/1ea235aa6b71bc9d6ce070d4b849dc60974082b5/src/torchcodec/_core/SingleStreamDecoder.h#L304-L309

This separation is a relic of when `SingleStreamDecoder` was trying to be more than just a single stream decoder. This API does not match the public Python API, and that fact causes some awkwardness, in particular with how we deal with custom frame mappings; see PR https://github.com/meta-pytorch/torchcodec/pull/1060#discussion_r2541371586 for more.

Note that one could imagine the current separation enables just getting metadata without adding a decoding stream, but that should be well supported with `SeekMode::approximate`.

	// Creates a SingleStreamDecoder from the video at videoFilePath.
	explicit SingleStreamDecoder(
	const std::string& videoFilePath,
	SeekMode seekMode = SeekMode::exact);

	// Creates a SingleStreamDecoder using the provided AVIOContext inside the
	// AVIOContextHolder. The AVIOContextHolder is the base class, and the
	// derived class will have specialized how the custom read, seek and writes
	// work.
	explicit SingleStreamDecoder(
	std::unique_ptr<AVIOContextHolder> context,
	SeekMode seekMode = SeekMode::exact);

	void addVideoStream(
	int streamIndex,
	std::vector<Transform*>& transforms,
	const VideoStreamOptions& videoStreamOptions = VideoStreamOptions(),
	std::optional<FrameMappings> customFrameMappings = std::nullopt);
	void addAudioStream(
	int streamIndex,
	const AudioStreamOptions& audioStreamOptions = AudioStreamOptions());

	void addStream(
	int streamIndex,
	AVMediaType mediaType,
	const torch::Device& device = torch::kCPU,
	const std::string_view deviceVariant = "ffmpeg",
	std::optional<int> ffmpegThreadCount = std::nullopt);

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Refactor C++ SingleStreamDecoder to consolidate addStream into constructor #1064

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Refactor C++ SingleStreamDecoder to consolidate addStream into constructor #1064

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions