close #19 by freelw · Pull Request #23 · freelw/cpp-transformer

freelw · 2025-06-08T12:11:01Z

lm support

Copilot

Pull Request Overview

Adds language model (LM) support with a new LM target, decoder modules, dataloaders, and updates build & documentation.

Introduce LMDecoderBlock and LMDecoder classes under module/language_model/
Add LMDataLoader, Vocab reuse, and update makefile, README, and logs for LM
Wire up new lm.cpp entry point and VSCode settings for debugging

Reviewed Changes

Copilot reviewed 33 out of 33 changed files in this pull request and generated 1 comment.

Show a summary per file

File	Description
module/translation/seq2seq.h	Redirect encoder/decoder includes into `translation/` subdir
module/language_model/*.{h,cpp}	New LM decoder block and decoder implementation
dataloaders/language_model/*.{h,cpp}	New LM data loader and vocab integration
dataloaders/vocab.{h,cpp}	Shared `Vocab` class for translation & LM
lm.cpp	New main entry for training/inference of language model
makefile	Added `lm` target and include paths for new modules
README.md, log.md	Documentation and logs updated with LM usage examples
.vscode/settings.json, .vscode/launch.json	Debug settings for LM

Comments suppressed due to low confidence (4)

dataloaders/language_model/lm_dataloader.h:1

The include guard has a typo (LM_DADALOADER_H); it should match the filename and read LM_DATALOADER_H.

#ifndef LM_DADALOADER_H

module/language_model/lm_decoder_block.h:1

[nitpick] No unit tests are provided for LMDecoderBlock; consider adding tests for its forward and get_parameters methods to cover key behaviors.

#ifndef LM_DECODER_BLOCK_H

module/language_model/lm_decoder_block.cpp:1

The header include uses a bare filename; to avoid ambiguity and ensure correct header resolution, consider using the full relative path: #include "module/language_model/lm_decoder_block.h".

#include "lm_decoder_block.h"

module/language_model/lm_decoder.cpp:1

Similar to the block file, include the decoder header via its subdirectory path: #include "module/language_model/lm_decoder.h" for consistency and to prevent collisions.

#include "lm_decoder.h"

Copilot · 2025-06-08T12:12:03Z

lm.cpp

+            break;
+        default:
+            std::cerr << "Usage: " << argv[0]
+                << " -f <corpus> -c <checpoint> -e <epochs>" << std::endl;


Fix the typo in the usage message: -c <checpoint> should be -c <checkpoint>.

Suggested change

<< " -f <corpus> -c <checpoint> -e <epochs>" << std::endl;

<< " -f <corpus> -c <checkpoint> -e <epochs>" << std::endl;

freelw added 30 commits June 8, 2025 12:26

update translation

22384b0

rm

e6e88d5

revert name

ab10863

revert name

244bfaa

update

b5169e2

lm init

1bb4bc9

update lm

3be6772

update

4b18651

update

d330593

update

feb3051

update

5db0069

update

4b57ba5

update

bb75b02

update

11ecb29

update

9a7b9c0

update

889069a

update

0adae76

update

b6bc604

update

0aa4cb1

update

0d9020a

update

94084d3

lm bug wip

3452038

dbg

843589a

lm loss dec

05dd126

lm predict wip

2787bb1

wip

abcefa5

lm 256 predict

89f89f0

wip

e030a8b

wip

080007c

wip

81b6a91

freelw added 3 commits June 8, 2025 19:56

lm works

311bbfa

update

806620c

update

36d4361

freelw requested a review from Copilot June 8, 2025 12:11

freelw merged commit f12e3f0 into main Jun 8, 2025
1 check passed

Copilot AI reviewed Jun 8, 2025

View reviewed changes

freelw mentioned this pull request Jun 8, 2025

Update Makefile for macOS Clang + OpenMP + C++11 #2

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

close #19#23

close #19#23
freelw merged 33 commits intomainfrom
wangli_dev_20250608_1

freelw commented Jun 8, 2025

Uh oh!

Uh oh!

Copilot AI left a comment

Uh oh!

Copilot AI Jun 8, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

	<< " -f <corpus> -c <checpoint> -e <epochs>" << std::endl;
	<< " -f <corpus> -c <checkpoint> -e <epochs>" << std::endl;

Conversation

freelw commented Jun 8, 2025

Uh oh!

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull Request Overview

Reviewed Changes

Uh oh!

Copilot AI Jun 8, 2025

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants