From NVIDIA Megatron-LM for visibility #18

RaymondLi0 · 2023-01-24T20:01:13Z

No description provided.

Signed-off-by: oliver könig <[email protected]>

Add sequence packing to RL

Signed-off-by: oliver könig <[email protected]>

Signed-off-by: Pablo Garay <[email protected]>

Signed-off-by: oliver könig <[email protected]>

…efill (#2358) Co-authored-by: Siddharth Singh <[email protected]>

Co-authored-by: Xin Yao <[email protected]>

Signed-off-by: Xin Yao <[email protected]>

This reverts commit 49eef58.

Co-authored-by: Jon Barker <[email protected]> Co-authored-by: Teodor-Dumitru Ene <[email protected]>

…xit (#2387) Co-authored-by: Jon Barker <[email protected]>

Co-authored-by: Siddharth Singh <[email protected]>

…t for selecting ephemeral ci hosts (#2402)

Co-authored-by: Shanmugam Ramasamy <[email protected]> Co-authored-by: oliver könig <[email protected]> Co-authored-by: Shanmugam Ramasamy <[email protected]>

Co-authored-by: Jon Barker <[email protected]>

Signed-off-by: Cory Ye <[email protected]>

Co-authored-by: Jon Barker <[email protected]>

This reverts commit f5531b0.

…on (#2229) Signed-off-by: Guyue Huang <[email protected]>

Signed-off-by: oliver könig <[email protected]>

…ch as flaky Signed-off-by: oliver könig <[email protected]>

…itsmatch` Signed-off-by: oliver könig <[email protected]>

Co-authored-by: Jon Barker <[email protected]>

This reverts commit 8cde93d.

RaymondLi0 changed the base branch from multi-query-attention to before-merge June 20, 2023 20:12

RaymondLi0 changed the base branch from before-merge to multi-query-attention June 20, 2023 20:12

ko3n1g and others added 28 commits October 23, 2025 18:51

chore: Update codeowners

2e38079

Signed-off-by: oliver könig <[email protected]>

ci(fix): No copyright on push

4d14c57

Signed-off-by: oliver könig <[email protected]>

ci: Extend queue-manager for dev branch (#1906)

a350a6e

Signed-off-by: oliver könig <[email protected]>

ci: Move test optimizer into its own bucket (#1909)

adf4247

Signed-off-by: oliver könig <[email protected]>

ci: Use matrix for approval-bot

1edc4d6

Signed-off-by: oliver könig <[email protected]>

ci: Update function name

04e640b

Signed-off-by: oliver könig <[email protected]>

ci: Adjust approval-bot for copy-pr-bot

c7f154f

Signed-off-by: oliver könig <[email protected]>

ci: Parametrize workflow

019084e

Signed-off-by: oliver könig <[email protected]>

ci: Parametrize workflow

aff784e

Signed-off-by: oliver könig <[email protected]>

ci: Remove attribute

4d282bf

Signed-off-by: oliver könig <[email protected]>

ci: Update container image tag to use GitHub SHA

a597390

chore: Remove file

94c6526

ci: Fix approval bot

5bfda01

Signed-off-by: oliver könig <[email protected]>

ci: Configure cherrypick bot (#1925)

7e661c9

Signed-off-by: oliver könig <[email protected]>

Ci approve dev (#1933)

9697129

Signed-off-by: oliver könig <[email protected]>

ci: Update nightly schedule (#1934)

6a8bbe9

Signed-off-by: oliver könig <[email protected]>

ci: Bump pre-flight for runs on main/dev (#1935)

3bd66cf

Signed-off-by: oliver könig <[email protected]>

ci: Allow skipping on main (#1936)

e965a15

Signed-off-by: oliver könig <[email protected]>

Ko3n1g/ci/pr template community bot (#1937)

621d17c

ci: More granular unit tests buckets (#1932)

3e07859

Signed-off-by: oliver könig <[email protected]>

Add sequence packing to RL (#1911)

b2b9b42

Add sequence packing to RL

chore: Update template (#1939)

bf06bbe

Signed-off-by: oliver könig <[email protected]>

chore: Add description about who can merge (#1940)

ddddc2f

Signed-off-by: oliver könig <[email protected]>

Ko3n1g/ci/fix main on eos (#1938)

6d1aa99

Signed-off-by: oliver könig <[email protected]>

Ko3n1g/ci/internal mrs (#1942)

0d5f25f

Signed-off-by: oliver könig <[email protected]>

ci: Fix branch of approval bot (#1944)

903c4b0

Signed-off-by: oliver könig <[email protected]>

ci: Approvalbot for other branches (#1947)

47a99af

Signed-off-by: oliver könig <[email protected]>

ci(fix): Approval bot (#1949)

768c312

Signed-off-by: oliver könig <[email protected]>

pablo-garay and others added 30 commits November 24, 2025 14:46

tidy / synthesize / enhance

ab1e26e

Signed-off-by: Pablo Garay <[email protected]>

Merge branch 'main' of https://github.com/NVIDIA/Megatron-LM

56e8810

Revert: trigger_mbridge_tests.yml‎ file change (#2389)

bc242d9

build: Upgrade deps (#2289)

49eef58

Signed-off-by: oliver könig <[email protected]>

Change KV cache init to empty to speedup graph recording and first pr…

2a51d86

…efill (#2358) Co-authored-by: Siddharth Singh <[email protected]>

Handle UVM compile lock issues (#2299)

4c7d3d6

Remove experimental tags for fused kernels. (#2233)

14b791b

Co-authored-by: Xin Yao <[email protected]>

Reduce Overhead in Timers (#2210)

ffb8c35

Signed-off-by: Xin Yao <[email protected]>

Revert "build: Upgrade deps (#2289)"

60df5c2

This reverts commit 49eef58.

Fix the entropy sign. (#2374)

ba9caf4

Co-authored-by: Jon Barker <[email protected]> Co-authored-by: Teodor-Dumitru Ene <[email protected]>

Remove RL use of mock dataloader and kill RL inference interface on e…

77a2d8b

…xit (#2387) Co-authored-by: Jon Barker <[email protected]>

Fix block_bag for RL (#2399)

6f65536

Co-authored-by: Siddharth Singh <[email protected]>

adding action for checking whether PR author is nvidia employee or no…

13efcb8

…t for selecting ephemeral ci hosts (#2402)

Added top n log probs (#2262)

898d633

Co-authored-by: Shanmugam Ramasamy <[email protected]> Co-authored-by: oliver könig <[email protected]> Co-authored-by: Shanmugam Ramasamy <[email protected]>

Fix logging when no IS is enabled. (#2375)

3f91727

Co-authored-by: Jon Barker <[email protected]>

fix: exit failure when PR author is external contributor removed (#2410)

6fc13a9

Various small fixes for Megatron-FSDP. (#2346)

ebb2e91

Signed-off-by: Cory Ye <[email protected]>

Add grpo loop functional test (#2403)

f5531b0

Co-authored-by: Jon Barker <[email protected]>

Revert "Add grpo loop functional test (#2403)"

cb8f94e

This reverts commit f5531b0.

YARN position embedding clear forward method lru cache in init functi…

5153663

…on (#2229) Signed-off-by: Guyue Huang <[email protected]>

Graph Config Implementation (#2380)

0819f3c

fix: adding k8s taints for ephermeral jobs (#2420)

b96d876

ci: Enable functional tests (#2419)

9f15fed

Signed-off-by: oliver könig <[email protected]>

Reapply "build: Upgrade deps (#2289)" (#2408)

40ef044

Signed-off-by: oliver könig <[email protected]>

fix: use a script to do node tainting in the cicd workflow (#2421)

b21bbad

ci: Mark gpt_dynamic_inference_tp1_pp1_583m_cuda_graphs_fp8_logitsmat…

65ce253

…ch as flaky Signed-off-by: oliver könig <[email protected]>

ci: Disable `gpt_static_inference_cuda_graphs_pad_tp4_pp1_ep4_16B_log…

6646d1a

…itsmatch` Signed-off-by: oliver könig <[email protected]>

Fix rl training with data reuse. (#2428)

66c07b0

Reapply - Add grpo loop functional test (#2411)

8cde93d

Co-authored-by: Jon Barker <[email protected]>

Revert "Reapply - Add grpo loop functional test (#2411)"

6cc29a2

This reverts commit 8cde93d.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

From NVIDIA Megatron-LM for visibility #18

From NVIDIA Megatron-LM for visibility #18

Uh oh!

RaymondLi0 commented Jan 24, 2023

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

36 participants

From NVIDIA Megatron-LM for visibility #18

Are you sure you want to change the base?

From NVIDIA Megatron-LM for visibility #18

Uh oh!

Conversation

RaymondLi0 commented Jan 24, 2023

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

36 participants