Can we add the ability to use TransNet V2 as the scene recognition engine, and have multiple interface languages, such as Chinese?
Can we add the ability to use TransNet V2 as the scene recognition engine, and have multiple interface languages, such as Chinese?