Ali NT
AliNT99
AI & ML interests
None yet
Recent Activity
commentedon a paper 18 days ago
Progressive Residual Warmup for Language Model Pretraining published a model about 1 month ago
AliNT99/Flash_attn2_2.8.3_cu128_sm120_cp312_cu128_torch210_wheel upvoted an article 5 months ago
ZeRO Optimization Strategies for Large-Scale Model Training - A brief Performance AnalysisOrganizations
None yet