Question regarding Cross-View Reconstruction

Hi, thanks for your excellent work!

I have a minor clarification question regarding the Cross-View Reconstruction setting.
Given a scene with multiple captured views, when one view is masked during training and the model is required to reconstruct it, how does the model identify which specific viewpoint should be reconstructed?

In particular, does the model rely on any additional information (e.g., camera pose, intrinsic parameters, view indices, or positional embeddings) to disambiguate the target view, or is this implicitly inferred from the input representation?

Thanks.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Question regarding Cross-View Reconstruction #19

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Question regarding Cross-View Reconstruction #19

Description

Metadata

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Issue actions