Commit Graph

48 Commits (e822e78afd731aaa0e6a608c5648cde7bc40eec5)

Author SHA1 Message Date
John Benediktsson 26abdf4a23 regexp: try again to fix the issue with backslashes.
the new simple rule is inside R/ syntax all backslashes should be
escaped to allow easy regexp literal tokenization:

R/ [\/]/

but in the constructor <regexp> they shouldn't be:

"[/]" <regexp>

When the regexp is prettyprinted we make sure to visually escape the
backslashes in the raw regexp.
2019-03-21 13:29:15 -07:00
Doug Coleman 15b13870ae peg.ebnf: Clean up EBNF: syntax.
The syntax for an EBNF parser is now ``EBNF: identifier string``. The
inline syntax went from [EBNF ...  EBNF] to EBNF[[ ]], EBNF[=[ ]=], etc.

<EBNF looked for a main word, but this functionality is just EBNF[[ ]]
now.
2017-08-05 22:23:57 -05:00
John Benediktsson 498285d7dd unicode: make this the API for all unicode things. 2016-03-30 23:29:48 -07:00
John Benediktsson ceb75057da change ERROR: words from throw-foo back to foo. 2015-08-13 16:13:05 -07:00
Doug Coleman b6be8685c3 basis: ERROR: changes. 2015-08-13 03:20:39 -07:00
Doug Coleman 352e5de16a core, basis, extra: Remove DOS line endings from files.
Remove whitespace from end of lines.
Add a newline to the end of each file.
2015-06-29 17:25:40 -07:00
John Benediktsson b366a06c41 use reject instead of [ ... not ] filter. 2015-05-12 18:50:34 -07:00
John Benediktsson 8089bbaeab regexp.parser: fix for recent unicode.scripts change. 2014-05-19 20:51:01 -07:00
Joe Groff df19e2ee01 regexp.parser: fix failing test 2011-12-06 21:11:58 -08:00
Joe Groff 943596575a use radix literals 2011-11-23 19:03:40 -08:00
Daniel Ehrenberg 88e88207ff Fixing bug in regexp \P 2010-03-23 00:30:49 -04:00
Daniel Ehrenberg da57436180 Finishing eliminating prune as a synonym of members 2010-02-27 14:52:24 -05:00
Slava Pestov 3353a777f7 Fixing some unit test failures 2009-04-22 07:05:00 -05:00
Doug Coleman dba4c0d589 fixing compiler errors in basis 2009-04-17 12:46:04 -05:00
Daniel Ehrenberg 541ce3aa68 Fixing regexp parser bug: now R/ foo/5 makes an error 2009-04-12 13:57:49 -05:00
Daniel Ehrenberg d6f9947bff Regexp character class intersection and difference syntax 2009-03-21 02:53:36 -05:00
Daniel Ehrenberg 9760f54857 Regexp supports Unicode properties (categories and script) 2009-03-20 19:03:26 -05:00
Daniel Ehrenberg ba9938c30f Reorganizing things in regexp, mostly 2009-03-18 16:09:45 -05:00
Daniel Ehrenberg 54194d269c Some reorganizing in Unicode; regexp class changes 2009-03-17 19:39:04 -05:00
Daniel Ehrenberg ababfe80ef More regexp docs; unix line ending support 2009-03-16 17:53:38 -05:00
Daniel Ehrenberg 8b286cea4c Adding word breaks to regexp 2009-03-11 15:51:54 -05:00
Daniel Ehrenberg 762485c2ca Fixing xmode use lines; adding fake reluctant ?*+ to make XMode work (they're actually greedy) 2009-03-08 18:50:41 -05:00
Daniel Ehrenberg 42ff154ead More regexp changes 2009-03-07 16:31:46 -06:00
Daniel Ehrenberg a487ed0f32 Lookaround and anchors work! (still need to fix some bugs) 2009-03-05 16:34:04 -06:00
Daniel Ehrenberg eb231df4e7 Beginnings of lookahead and lookbehind 2009-03-04 00:36:03 -06:00
Daniel Ehrenberg c708bfcbca Various regexp changes, including the addition of regexp combinators 2009-02-25 12:22:12 -06:00
Daniel Ehrenberg 041be23cdc trivial change in regexp 2009-02-20 18:45:24 -06:00
Daniel Ehrenberg e41cdf5e8f Various unfinshed regexp changes 2009-02-20 17:54:48 -06:00
Daniel Ehrenberg f535b66aed Negation almost complete in regexp 2009-02-19 18:28:54 -06:00
Daniel Ehrenberg 9565b59928 Regexp negation (partial) and cleanup of regexp.nfa 2009-02-19 16:48:46 -06:00
Daniel Ehrenberg b8845cb87e Almost done with regexp cleanup 2009-02-18 12:27:07 -06:00
Daniel Ehrenberg 105ef28433 Rewriting regexp parser 2009-02-16 20:23:00 -06:00
Slava Pestov f34c14a0f5 Remove some usages of tuck 2009-01-23 18:20:47 -06:00
Daniel Ehrenberg ad53cb8635 Privatizing unicode.case:ch>{lower,upper,title} 2009-01-08 19:07:46 -06:00
Slava Pestov c5160d76e9 Merge qualified, alias, symbols, constants into core 2008-12-17 18:10:01 -06:00
Doug Coleman 384a11ecee move things around. the parser no longer adjusts nodes based on options, instead opting for nfa to handle it (case-insensitive, multiline, dotall, reversed..) 2008-11-24 22:17:47 -06:00
Doug Coleman f8a23c657b a bit of refactoring, preparing to take options out of the parsing stage 2008-11-24 12:59:29 -06:00
Doug Coleman db3c21663e better literal syntax for regexps. support ^ and $ (not in multiline mode yet) but not yet in multiline mode. support for \A and \Z \z is next. removed some crud from the parser, and added more commented out unit tests... 2008-11-24 00:18:27 -06:00
Doug Coleman afc97627f9 remove >r r> from regexp 2008-11-22 23:01:24 -06:00
Doug Coleman ba680fae6f handle dotall mode -- . matches newlines when it's on. dotall mode is off by default. rearrange unit tests a bit 2008-11-22 21:04:09 -06:00
Doug Coleman e110bac51a better parsing of constants 2008-11-22 16:50:40 -06:00
Doug Coleman 239bfeafda fix "\\/" <regexp> 2008-11-18 16:06:56 -06:00
Slava Pestov 5c51d9fd2c Get regexp words to infer 2008-11-17 21:42:59 -06:00
Doug Coleman 46aa56730b better parsing for anchors 2008-11-06 16:53:00 -06:00
Doug Coleman 80a9147691 working on lookbehind 2008-09-22 13:37:27 -05:00
Doug Coleman 56fbeb25ff fix options before non-capturing groups 2008-09-22 11:45:36 -05:00
Doug Coleman da35e13153 fix the parser for groups 2008-09-22 10:48:01 -05:00
Slava Pestov fe071a8908 Move regexp to basis 2008-09-22 02:37:40 -05:00