Fix middle-word-em interfering with strongs (#637) #639

Crozzers · 2025-09-28T10:53:18Z

This PR fixes #637.

The issue came down to the fact that the extra would run before the italics and bold stage. It would attempt to ignore the <strong> and then process strict <em>s and middle-word-ems. The problem is that the syntax for strongs and ems are very similar, and trying to craft a regex that can differentiate is tough.

The way this extra worked previously was to process valid <em> syntax and then hash anything that looks like <em> syntax but isn't quite valid.

The new approach is simply to find any _ or * character in the middle of a word and hash it. This way, the regular italics and bold stage don't have to worry about them and we can keep the regexes simple.

The hash we use is basically the same that you find in self._escape_table except we prefix the extra's name to the input text to prevent interference with escaped/hashed chars from other stages

nicholasserra · 2025-09-29T19:51:26Z

LGTM Thanks!

Crozzers added 2 commits September 28, 2025 11:43

Fix middle-word-em issue trentm#637

f6d8b0e

Update changelog

1acbf8f

nicholasserra merged commit f44849c into trentm:master Sep 29, 2025
15 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Fix middle-word-em interfering with strongs (#637) #639

Fix middle-word-em interfering with strongs (#637) #639

Uh oh!

Crozzers commented Sep 28, 2025

Uh oh!

Uh oh!

nicholasserra commented Sep 29, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Fix middle-word-em interfering with strongs (#637) #639

Fix middle-word-em interfering with strongs (#637) #639

Uh oh!

Conversation

Crozzers commented Sep 28, 2025

Uh oh!

Uh oh!

nicholasserra commented Sep 29, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants