Problem with standard regex

Kornel Benko kornel at lyx.org
Tue Jan 5 09:56:13 UTC 2021


Am Mon, 4 Jan 2021 23:37:04 +0200
schrieb Yuriy Skalko <yuriy.skalko at gmail.com>:

> > Thanks. I am not so sure that 'from_ascii()' is the better choice comparing to
> > 'from_utf8()' though.  
> 
> Maybe it will be better, but I cannot remember seeing exceptions with 
> `what`-messages not in plain English. Feel free to update.
> 
> 
> > BTW, I tested the regexes with Russian documents, and found an error in 
> > Additional.lyx
> > Correction attached.  
> 
> Really, that was missed out. Please commit it.
> 

Done at c6bc5f0c

> > Is the regex handling (with enabled format) now to your liking?
> > 
> >         Kornel  
> 
> I've tested the regexes (with and without format). Now there are no 
> problems with Cyrillic in any practical regexes I can think of. Thanks 
> for advsearch that is really usable now, Kornel!
> 

Tried some exotic ways?
1.) Search for 3 or more consecutive identical chars
	'(\S)\1\1+'
2.) Repeated words
	'\b(\w+)\s+\1\b'
  # You will find 'действительно действительно', 'после после' in Additional.lyx
3.) Different languages (e.g.Latin) (Wrap the whole expression into Latin env
	'.+'
etc

> Yuriy

Happy it works.

	Kornel
-------------- next part --------------
A non-text attachment was scrubbed...
Name: not available
Type: application/pgp-signature
Size: 488 bytes
Desc: Digitale Signatur von OpenPGP
URL: <http://lists.lyx.org/pipermail/lyx-devel/attachments/20210105/7afd80c1/attachment.asc>


More information about the lyx-devel mailing list