Activity
add the proposed jumbo vit from Fuller et al. of Carleton University
add the proposed jumbo vit from Fuller et al. of Carleton University
Force push
add the proposed jumbo vit from Fuller et al. of Carleton University
add the proposed jumbo vit from Fuller et al. of Carleton University
Force push
add the proposed jumbo vit from Fuller et al. of Carleton University
add the proposed jumbo vit from Fuller et al. of Carleton University
add a simple vit flavor for a new bytedance paper that proposes to br…
add a simple vit flavor for a new bytedance paper that proposes to br…
Force push
add a simple vit flavor for a new bytedance paper that proposes to br…
add a simple vit flavor for a new bytedance paper that proposes to br…
allow for qk norm to be turned off for na vit nested tensor
allow for qk norm to be turned off for na vit nested tensor
update minimum version for nested tensor of NaViT
update minimum version for nested tensor of NaViT
add value residual based simple vit
add value residual based simple vit
fix multiheaded qk rmsnorm in nViT
fix multiheaded qk rmsnorm in nViT
go all the way with the normalized vit, fix some scales
go all the way with the normalized vit, fix some scales
Force push
go all the way with the normalized vit, fix some scales
go all the way with the normalized vit, fix some scales
cite for hypersphere vit adapted from ngpt
cite for hypersphere vit adapted from ngpt
go for multi-headed rmsnorm for the qknorm on hypersphere vit
go for multi-headed rmsnorm for the qknorm on hypersphere vit
add register tokens to the nested tensor 3d na vit example for resear…
add register tokens to the nested tensor 3d na vit example for resear…