[BUG] Japanese text is misidentified as URL. #390

mattn · 2023-05-01T16:57:18Z

Describe the bug
Some Japanese text is unexpectedly misidentified as URL.

To Reproduce
Steps to reproduce the behavior:

Open input dialog
Type some Japanese characters contains 。 (\u3002)
Post the note
See the text is misidentified as URL.

Expected behavior
The text should be normal text.

Device (please complete the following information):

Android Version: 0.37.4

This is related on linkedin/URL-Detector. linkedin/URL-Detector#39

URL-Detector handle 。 as dot. This is not a bug because IDN allow to use 。 as dot. However, most of Japanese text are often misidentified.

The text was updated successfully, but these errors were encountered:

afternooncurry · 2023-05-11T05:31:31Z

Ideographic full stop and full width period are not handled as dot in IDNA2008 while IDNA2003 does. A current recommendation is IDNA2008. linkedin/URL-Detector looks implemented based on IDNA2003 about IDN which may causes the issue.

mattn · 2023-06-15T23:38:38Z

@vitorpamplona Many users in the East Asian region have been waiting for this fix for a long time.

vitorpamplona · 2023-06-15T23:41:50Z

We are waiting for the library to be able to support these additional characters. Until that library is fixed, there is not much we can do :(

mattn · 2023-06-15T23:44:43Z

Thanks. FYI, This issue can be reproduced with English speakers.

vitorpamplona · 2023-06-15T23:50:57Z

Interesting. That is a different "bug".

We have a separate procedure to linkify all yyy.xxx texts. It requires the domain separator . and should not affect the \u3002 character. Is that an issue for you? I see it here and there during the week but always just ignore it because it works most of the time.

vitorpamplona · 2023-06-15T23:54:36Z

And now that urls can have any Unicode character, I am not really sure how to solve it. Because the Thanks.<emoji> is a valid URL and could actually exist these days.

mattn · 2023-06-16T00:47:29Z

I don't make sure what is wrong in the code but this is rendered as URL in only amethyst.

mattn · 2023-07-05T05:22:02Z

Is this related on this issue?

mattn · 2023-07-07T05:30:47Z

I confirmed this issue is fixed in #491

Thanks.

ShinoharaTa mentioned this issue Jul 5, 2023

Fixed a bug in Japanese string display #491

Merged

mattn closed this as completed Jul 7, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[BUG] Japanese text is misidentified as URL. #390

[BUG] Japanese text is misidentified as URL. #390

mattn commented May 1, 2023 •

edited

Loading

afternooncurry commented May 11, 2023

mattn commented Jun 15, 2023

vitorpamplona commented Jun 15, 2023

mattn commented Jun 15, 2023

vitorpamplona commented Jun 15, 2023 •

edited

Loading

vitorpamplona commented Jun 15, 2023 •

edited

Loading

mattn commented Jun 16, 2023

mattn commented Jul 5, 2023

mattn commented Jul 7, 2023

[BUG] Japanese text is misidentified as URL. #390

[BUG] Japanese text is misidentified as URL. #390

Comments

mattn commented May 1, 2023 • edited Loading

afternooncurry commented May 11, 2023

mattn commented Jun 15, 2023

vitorpamplona commented Jun 15, 2023

mattn commented Jun 15, 2023

vitorpamplona commented Jun 15, 2023 • edited Loading

vitorpamplona commented Jun 15, 2023 • edited Loading

mattn commented Jun 16, 2023

mattn commented Jul 5, 2023

mattn commented Jul 7, 2023

mattn commented May 1, 2023 •

edited

Loading

vitorpamplona commented Jun 15, 2023 •

edited

Loading

vitorpamplona commented Jun 15, 2023 •

edited

Loading