Problem with standard regex
Kornel Benko
kornel at lyx.org
Tue Jan 5 09:56:13 UTC 2021
Am Mon, 4 Jan 2021 23:37:04 +0200
schrieb Yuriy Skalko <yuriy.skalko at gmail.com>:
> > Thanks. I am not so sure that 'from_ascii()' is the better choice comparing to
> > 'from_utf8()' though.
>
> Maybe it will be better, but I cannot remember seeing exceptions with
> `what`-messages not in plain English. Feel free to update.
>
>
> > BTW, I tested the regexes with Russian documents, and found an error in
> > Additional.lyx
> > Correction attached.
>
> Really, that was missed out. Please commit it.
>
Done at c6bc5f0c
> > Is the regex handling (with enabled format) now to your liking?
> >
> > Kornel
>
> I've tested the regexes (with and without format). Now there are no
> problems with Cyrillic in any practical regexes I can think of. Thanks
> for advsearch that is really usable now, Kornel!
>
Tried some exotic ways?
1.) Search for 3 or more consecutive identical chars
'(\S)\1\1+'
2.) Repeated words
'\b(\w+)\s+\1\b'
# You will find 'действительно действительно', 'после после' in Additional.lyx
3.) Different languages (e.g.Latin) (Wrap the whole expression into Latin env
'.+'
etc
> Yuriy
Happy it works.
Kornel
-------------- next part --------------
A non-text attachment was scrubbed...
Name: not available
Type: application/pgp-signature
Size: 488 bytes
Desc: Digitale Signatur von OpenPGP
URL: <http://lists.lyx.org/pipermail/lyx-devel/attachments/20210105/7afd80c1/attachment.asc>
More information about the lyx-devel
mailing list