Danjie
Wenren
Home
Archive
About
Home
Archive
About
Danjie Wenren
Doing cool stuff
LinkedIn
GitHub
Categories
Research
4
Tags
fsdp
jax
llm
parallel training
parameterization
post-training
pre-training
pytorch
rlvr
scaling law
sharding
tensor parallelism
tips