Talk:Regular expression
This is the talk page for discussing improvements to the Regular expression article. This is not a forum for general discussion of the article's subject. |
Article policies
|
Find sources: Google (books · news · scholar · free images · WP refs) · FENS · JSTOR · TWL |
Archives: 1, 2Auto-archiving period: 3 months |
This level-5 vital article is rated B-class on Wikipedia's content assessment scale. It is of interest to the following WikiProjects: | |||||||||||||||||||||||||||||||||||||||||
|
Text and/or other creative content from this version of Regular expression examples was copied or moved into Regular expression with this edit on 22:28, 9 January 2012. The former page's history now serves to provide attribution for that content in the latter page, and it must not be deleted as long as the latter page exists. |
|
||
This page has archives. Sections older than 90 days may be automatically archived by Lowercase sigmabot III when more than 4 sections are present. |
Regular Expression Crossword =
[edit]Hi hope this is appropriate to add this comment. There is a regular expression puzzle web site called regexcrossword.com. I have found this very useful for getting the finer points/practise at writing good regular expressions. Enjoy. Robin48gx (talk) 09:24, 16 September 2024 (UTC)
Replace first thumbnail image with a non religious image and with non ECMAScript expression
[edit]For accessibility and WP:NPOV, perhaps "The quick brown fox jumps over the lazy dog" with the pattern [aeiou]+
. Looks like the current image was taken from https://regexr.com/.
If I understand correctly the current /h[aeiou]+/g
in the thumbnail is an ECMAScript convention1 but doesn't mention so, hence also in combination would drop the prefix /
and suffix /g
.
1 https://262.ecma-international.org/5.1/#sec-7.8.5 31.20.106.40 (talk) 11:47, 10 October 2023 (UTC)
- Not just ECMAScript uses the slash regex syntax. Perl, which played a key role in the growth of more complex features, uses this /expression/flag thing; that in turn evolved from ed's /expression/ syntax. As for religiousness, I don't care much about it. I am concerned that the new [aeiou]+ pattern is too simple, however. (Try a longer, neutral text: The Universal Declaration of Human Rights looks good against h[aeiou]+.) Artoria2e5 🌉 12:20, 8 February 2024 (UTC)
- I created the current image in 2022 without giving it much thought, mostly just intending to give a flavour of the complexity of regular expressions. The previous illustration was File:The river effect in justified text.jpg which looked more like a regular text search for a double space. It also had the complex example code of
(?<=\.) {2,}(?=[A-Z])
, I assume because the image came first (it's from the sentence spacing article) and the regexp was written to fit. - I think the lead image example just needs to be simple enough that somebody learning about regular expressions for the first time would quickly understand the concept and be able to more or less see what the regexp search term meant - and also complex enough that the same reader could, in the highlighted output, see the power it had above a regular text search.
- A straight
[aeiou]+
does seem too simple, as in practice (assuming that we're keeping things simple and only using a single highlight colour) the output would be the same as for[aeiou]
. Belbury (talk) 17:18, 26 February 2024 (UTC)- I'd prefer a non-religious text, too. What about the US declaration of independence (since the EN wiki server resides in the US)? Or some famous text of (e.g.) Shakespeare? The search pattern
/h[aeiou]+/g
seems fine for any of these. - Jochen Burghardt (talk) 17:12, 27 February 2024 (UTC)- Done: Since nobody objected, I've implemented my suggestion, using the start of Antony's burial speech in Julius Caesar by Shakespeare. I changed the pattern to
/r[aeiouy]+/
to get a more interesting image; if considering "y" as a vowel is a problem, let me know; I can remove it from the pattern. - Jochen Burghardt (talk) 19:12, 12 March 2024 (UTC)- @Jochen Burghardt: Good move, thanks. Some readers may find the inclusion of "y" a bit odd. Amending the description might be better than removing the letter, though, so perhaps Shaded text shows the match results of the regular expression pattern(That also takes care of MOS:COLOR.) Bazza 7 (talk) 13:35, 13 March 2024 (UTC)
/r[aeiouy]+/g
which finds all occurrences of the letter r followed by one or more vowels or the letter y. - Good call, thanks for taking the time to find a quote. Although there is a small issue here in the fact that
Romans
isn't highlighted in the example. Not sure if it would be better to update the image and include an/i
option in the caption, or update the caption toa lower case r followed by one or more lower-case ...
Belbury (talk) 13:50, 13 March 2024 (UTC)- Thanks for the corrections. I now omitted the "y" in the picture, in order to keep the informal description short (just "vowel"). Moreover, I changed "letter r" to "lower case r" in the description, in order not to presuppose too much knowledge about search options like
/g
(which seems unavoidable) and/i
. An alternative could be/[Rr][aeiou]/g
, which is unnecessary complicated, however (exemplifying[]
just once is sufficient). - Jochen Burghardt (talk) 14:47, 14 March 2024 (UTC)
- Thanks for the corrections. I now omitted the "y" in the picture, in order to keep the informal description short (just "vowel"). Moreover, I changed "letter r" to "lower case r" in the description, in order not to presuppose too much knowledge about search options like
- @Jochen Burghardt: Good move, thanks. Some readers may find the inclusion of "y" a bit odd. Amending the description might be better than removing the letter, though, so perhaps
- Done: Since nobody objected, I've implemented my suggestion, using the start of Antony's burial speech in Julius Caesar by Shakespeare. I changed the pattern to
- I'd prefer a non-religious text, too. What about the US declaration of independence (since the EN wiki server resides in the US)? Or some famous text of (e.g.) Shakespeare? The search pattern
- I created the current image in 2022 without giving it much thought, mostly just intending to give a flavour of the complexity of regular expressions. The previous illustration was File:The river effect in justified text.jpg which looked more like a regular text search for a double space. It also had the complex example code of
Perhaps an error
[edit]The text on the main page says this: "Every regular expression can be written solely in terms of the Kleene star and set unions over finite words." I think concatenation is also needed; if you have only Kleene star and unions over finite sets of words, you cannot make {1} conc {0}* (sets of words starting with 1 followed by arbitrarily many zeroes). 137.132.217.132 (talk) 09:07, 12 March 2024 (UTC)
"look-ahead" or "lookahead"
[edit]Both "look-ahead" and "lookahead" are used in the Regular_expression#Assertions section of the page. I imagine "look-ahead" is incorrect, but I will leave it up to someone else to either clarify that both are correct or remove the incorrect one. Llamasaylol (talk) 17:32, 29 May 2024 (UTC)
- @Llamasaylol: [1] uses lookahead and lookbehind. I've made those two consistent in the article. Bazza 7 (talk) 23:27, 18 January 2025 (UTC)
With the java script regex engine you get look aheads (neg and pos). But with the python regex engine you get look behinds as well. Robin48gx (talk) 09:25, 16 September 2024 (UTC)
- B-Class level-5 vital articles
- Wikipedia level-5 vital articles in Technology
- B-Class vital articles in Technology
- B-Class Computing articles
- Top-importance Computing articles
- B-Class software articles
- High-importance software articles
- B-Class software articles of High-importance
- All Software articles
- B-Class Computer science articles
- Top-importance Computer science articles
- All Computing articles
- WikiProject Computer science articles