Is the loss of the first word covered during the language model evaluation?

In the language model example, it seems that during the evaluation, the code starts from computing the loss of the second word. Thus, skipping the loss of the first word. 
https://github.com/pytorch/examples/blob/537f6971872b839b36983ff40dafe688276fe6c3/word_language_model/main.py#L136
https://github.com/pytorch/examples/blob/537f6971872b839b36983ff40dafe688276fe6c3/word_language_model/main.py#L121-L125

Furthermore, the evaluation data is divided into 10 batches, hence, the losses of 10 words are skipped.
Am I right or I did miss something?
https://github.com/pytorch/examples/blob/537f6971872b839b36983ff40dafe688276fe6c3/word_language_model/main.py#L85-L88

	def get_batch(source, i):
	seq_len = min(args.bptt, len(source) - 1 - i)
	data = source[i:i+seq_len]
	target = source[i+1:i+1+seq_len].view(-1)
	return data, target

	eval_batch_size = 10
	train_data = batchify(corpus.train, args.batch_size)
	val_data = batchify(corpus.valid, eval_batch_size)
	test_data = batchify(corpus.test, eval_batch_size)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Is the loss of the first word covered during the language model evaluation? #453

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Is the loss of the first word covered during the language model evaluation? #453

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions