Doug Coleman
							
						 | 
						
							
							
							
							
								
							
								edba5f2bbe
								
							
								
							
						 | 
						
							
							
								
								Fix bug in html.parser.analyzer
							
							
							
							
							
						 | 
						
							2010-10-25 12:50:43 -05:00 | 
						
					
				
					
						
							
							
								 
								Doug Coleman
							
						 | 
						
							
							
							
							
								
							
								1e9afc927f
								
							
								
							
						 | 
						
							
							
								
								Simplify combinator with joe's suggestion, unit test
							
							
							
							
							
						 | 
						
							2010-08-29 15:24:55 -05:00 | 
						
					
				
					
						
							
							
								 
								Doug Coleman
							
						 | 
						
							
							
							
							
								
							
								bd3fccfd4a
								
							
								
							
						 | 
						
							
							
								
								Clean up html.parser.analyzer, apply blei's fix for find-between*
							
							
							
							
							
						 | 
						
							2010-08-29 13:22:11 -05:00 | 
						
					
				
					
						
							
							
								 
								Philipp Brüschweiler
							
						 | 
						
							
							
							
							
								
							
								99a37fd571
								
							
								
							
						 | 
						
							
							
								
								html.parser.analyzer: find-between* didn't always return a sequence, which in turn broke find-hrefs (reported by randy7)
							
							
							
							
							
						 | 
						
							2010-08-13 22:53:37 +02:00 | 
						
					
				
					
						
							
							
								 
								Daniel Ehrenberg
							
						 | 
						
							
							
							
							
								
							
								da57436180
								
							
								
							
						 | 
						
							
							
								
								Finishing eliminating prune as a synonym of members
							
							
							
							
							
						 | 
						
							2010-02-27 14:52:24 -05:00 | 
						
					
				
					
						
							
							
								 
								Slava Pestov
							
						 | 
						
							
							
							
							
								
							
								df4fb4a3ee
								
							
								
							
						 | 
						
							
							
								
								Removing integers-as-sequences
							
							
							
							
							
						 | 
						
							2010-01-15 07:15:33 +13:00 | 
						
					
				
					
						
							
							
								 
								Slava Pestov
							
						 | 
						
							
							
							
							
								
							
								0612bc6177
								
							
								
							
						 | 
						
							
							
								
								Factor source files should not be executable
							
							
							
							
							
						 | 
						
							2009-11-21 17:50:43 -06:00 | 
						
					
				
					
						
							
							
								 
								Doug Coleman
							
						 | 
						
							
							
							
							
								
							
								3e28be6568
								
							
								
							
						 | 
						
							
							
								
								move sequence-parser to sequences.parser
							
							
							
							
							
						 | 
						
							2009-11-05 17:12:10 -06:00 | 
						
					
				
					
						
							
							
								 
								Keith Lazuka
							
						 | 
						
							
							
							
							
								
							
								405e5d015b
								
							
								
							
						 | 
						
							
							
								
								docs: change $subsection to $subsections
							
							
							
							
							
						 | 
						
							2009-10-02 12:15:48 -04:00 | 
						
					
				
					
						
							
							
								 
								Doug Coleman
							
						 | 
						
							
							
							
							
								
							
								725a07b62d
								
							
								
							
						 | 
						
							
							
								
								add find-images word
							
							
							
							
							
						 | 
						
							2009-08-26 14:28:06 -05:00 | 
						
					
				
					
						
							
							
								 
								Doug Coleman
							
						 | 
						
							
							
							
							
								
							
								3f3d57032b
								
							
								
							
						 | 
						
							
							
								
								Delete empty unit tests files, remove 1- and 1+, reorder IN: lines in a lot of places, minor refactoring
							
							
							
							
							
						 | 
						
							2009-08-13 19:21:44 -05:00 | 
						
					
				
					
						
							
							
								 
								Phil Dawes
							
						 | 
						
							
							
							
							
								
							
								c1d08d213f
								
							
								
							
						 | 
						
							
							
								
								Added thead, tfoot to html
							
							
							
							
							
						 | 
						
							2009-06-30 16:26:51 +01:00 | 
						
					
				
					
						
							
							
								 
								Doug Coleman
							
						 | 
						
							
							
							
							
								
							
								407377fc98
								
							
								
							
						 | 
						
							
							
								
								rename peek -> last and update all usages
							
							
							
							
							
						 | 
						
							2009-05-25 16:38:33 -05:00 | 
						
					
				
					
						
							
							
								 
								Doug Coleman
							
						 | 
						
							
							
							
							
								
							
								8f688eb742
								
							
								
							
						 | 
						
							
							
								
								fix comments and dtds in html parser
							
							
							
							
							
						 | 
						
							2009-05-20 15:50:01 -05:00 | 
						
					
				
					
						
							
							
								 
								Slava Pestov
							
						 | 
						
							
							
							
							
								
							
								b62630b5ac
								
							
								
							
						 | 
						
							
							
								
								Updating code in basis and extra for stricter vocab search path semantics
							
							
							
							
							
						 | 
						
							2009-05-14 23:23:06 -05:00 | 
						
					
				
					
						
							
							
								 
								Doug Coleman
							
						 | 
						
							
							
							
							
								
							
								9ac2214b62
								
							
								
							
						 | 
						
							
							
								
								fix html.parser
							
							
							
							
							
						 | 
						
							2009-04-11 12:11:00 -05:00 | 
						
					
				
					
						
							
							
								 
								Doug Coleman
							
						 | 
						
							
							
							
							
								
							
								6583b4d38e
								
							
								
							
						 | 
						
							
							
								
								rename html.parser.state to sequence-parser
							
							
							
							
							
						 | 
						
							2009-04-09 15:28:48 -05:00 | 
						
					
				
					
						
							
							
								 
								Doug Coleman
							
						 | 
						
							
							
							
							
								
							
								8875c2ba26
								
							
								
							
						 | 
						
							
							
								
								return links as URL objects in html vocab
							
							
							
							
							
						 | 
						
							2009-04-03 21:15:52 -05:00 | 
						
					
				
					
						
							
							
								 
								Doug Coleman
							
						 | 
						
							
							
							
							
								
							
								a42b872a23
								
							
								
							
						 | 
						
							
							
								
								fix bug in take-sequence
							
							
							
							
							
						 | 
						
							2009-04-03 12:40:45 -05:00 | 
						
					
				
					
						
							
							
								 
								Doug Coleman
							
						 | 
						
							
							
							
							
								
							
								7c7742cafa
								
							
								
							
						 | 
						
							
							
								
								use unicode instead of ascii again
							
							
							
							
							
						 | 
						
							2009-04-02 01:18:53 -05:00 | 
						
					
				
					
						
							
							
								 
								Doug Coleman
							
						 | 
						
							
							
							
							
								
							
								9bee1fe004
								
							
								
							
						 | 
						
							
							
								
								fix take-rest for out of bounds
							
							
							
							
							
						 | 
						
							2009-04-01 22:39:20 -05:00 | 
						
					
				
					
						
							
							
								 
								Doug Coleman
							
						 | 
						
							
							
							
							
								
							
								7b6260ca8c
								
							
								
							
						 | 
						
							
							
								
								remove tokenize-line
							
							
							
							
							
						 | 
						
							2009-04-01 17:28:36 -05:00 | 
						
					
				
					
						
							
							
								 
								Doug Coleman
							
						 | 
						
							
							
							
							
								
							
								6af6de1aac
								
							
								
							
						 | 
						
							
							
								
								make tokenize-line configurable, fix bug in take-quoted-string
							
							
							
							
							
						 | 
						
							2009-04-01 15:51:39 -05:00 | 
						
					
				
					
						
							
							
								 
								Doug Coleman
							
						 | 
						
							
							
							
							
								
							
								d64e07af8b
								
							
								
							
						 | 
						
							
							
								
								fix bug in state-parser, add take-token
							
							
							
							
							
						 | 
						
							2009-04-01 15:23:05 -05:00 | 
						
					
				
					
						
							
							
								 
								Doug Coleman
							
						 | 
						
							
							
							
							
								
							
								947bcc3d33
								
							
								
							
						 | 
						
							
							
								
								state-parser take-quoted-string rewinds if the string is not found
							
							
							
							
							
						 | 
						
							2009-04-01 14:50:43 -05:00 | 
						
					
				
					
						
							
							
								 
								Doug Coleman
							
						 | 
						
							
							
							
							
								
							
								1e4eebda3a
								
							
								
							
						 | 
						
							
							
								
								refactor state parser some more, add a word to parse escaped strings
							
							
							
							
							
						 | 
						
							2009-04-01 14:36:08 -05:00 | 
						
					
				
					
						
							
							
								 
								Doug Coleman
							
						 | 
						
							
							
							
							
								
							
								d52535b63a
								
							
								
							
						 | 
						
							
							
								
								set non-key/value attributes to themselves
							
							
							
							
							
						 | 
						
							2009-04-01 13:42:38 -05:00 | 
						
					
				
					
						
							
							
								 
								Doug Coleman
							
						 | 
						
							
							
							
							
								
							
								09e4d34ff2
								
							
								
							
						 | 
						
							
							
								
								rename next to advance
							
							
							
							
							
						 | 
						
							2009-04-01 12:52:02 -05:00 | 
						
					
				
					
						
							
							
								 
								Doug Coleman
							
						 | 
						
							
							
							
							
								
							
								28dae46b7d
								
							
								
							
						 | 
						
							
							
								
								make html.parser words private
							
							
							
							
							
						 | 
						
							2009-04-01 12:48:51 -05:00 | 
						
					
				
					
						
							
							
								 
								Doug Coleman
							
						 | 
						
							
							
							
							
								
							
								9ecf8ec3db
								
							
								
							
						 | 
						
							
							
								
								remove dead code
							
							
							
							
							
						 | 
						
							2009-04-01 12:47:17 -05:00 | 
						
					
				
					
						
							
							
								 
								Doug Coleman
							
						 | 
						
							
							
							
							
								
							
								826d9f18c5
								
							
								
							
						 | 
						
							
							
								
								fix parsing of attributes for nofollows
							
							
							
							
							
						 | 
						
							2009-04-01 12:44:38 -05:00 | 
						
					
				
					
						
							
							
								 
								Doug Coleman
							
						 | 
						
							
							
							
							
								
							
								7060a5905f
								
							
								
							
						 | 
						
							
							
								
								add take-sequence to state parser
							
							
							
							
							
						 | 
						
							2009-04-01 12:44:06 -05:00 | 
						
					
				
					
						
							
							
								 
								Doug Coleman
							
						 | 
						
							
							
							
							
								
							
								d82b8ba4eb
								
							
								
							
						 | 
						
							
							
								
								more cleanup
							
							
							
							
							
						 | 
						
							2009-04-01 11:48:44 -05:00 | 
						
					
				
					
						
							
							
								 
								Doug Coleman
							
						 | 
						
							
							
							
							
								
							
								fdb8c9da1a
								
							
								
							
						 | 
						
							
							
								
								cleaning up html.parser
							
							
							
							
							
						 | 
						
							2009-04-01 11:43:30 -05:00 | 
						
					
				
					
						
							
							
								 
								Doug Coleman
							
						 | 
						
							
							
							
							
								
							
								20df429a50
								
							
								
							
						 | 
						
							
							
								
								take-until doesnt pass the element to the quotation anymore
							
							
							
							
							
						 | 
						
							2009-04-01 03:13:38 -05:00 | 
						
					
				
					
						
							
							
								 
								Doug Coleman
							
						 | 
						
							
							
							
							
								
							
								f994654af3
								
							
								
							
						 | 
						
							
							
								
								add take-while to state parser
							
							
							
							
							
						 | 
						
							2009-04-01 02:45:20 -05:00 | 
						
					
				
					
						
							
							
								 
								Doug Coleman
							
						 | 
						
							
							
							
							
								
							
								99c3cd9517
								
							
								
							
						 | 
						
							
							
								
								more renaing get-char to current
							
							
							
							
							
						 | 
						
							2009-04-01 02:33:38 -05:00 | 
						
					
				
					
						
							
							
								 
								Doug Coleman
							
						 | 
						
							
							
							
							
								
							
								aa37871ff9
								
							
								
							
						 | 
						
							
							
								
								rename get-next to peek-next, get-char to current
							
							
							
							
							
						 | 
						
							2009-04-01 02:32:48 -05:00 | 
						
					
				
					
						
							
							
								 
								Doug Coleman
							
						 | 
						
							
							
							
							
								
							
								8e26b19cc0
								
							
								
							
						 | 
						
							
							
								
								state-parser works with sequences, not strings
							
							
							
							
							
							
							
							fix bug with take-until 
							
						 | 
						
							2009-03-31 18:49:41 -05:00 | 
						
					
				
					
						
							
							
								 
								Doug Coleman
							
						 | 
						
							
							
							
							
								
							
								ebddd32677
								
							
								
							
						 | 
						
							
							
								
								remove duplication, refactor html.parser to use new state parser
							
							
							
							
							
						 | 
						
							2009-03-31 16:05:11 -05:00 | 
						
					
				
					
						
							
							
								 
								Doug Coleman
							
						 | 
						
							
							
							
							
								
							
								a07c17598e
								
							
								
							
						 | 
						
							
							
								
								redo state parser to avoid dynamic variables
							
							
							
							
							
						 | 
						
							2009-03-31 16:04:39 -05:00 | 
						
					
				
					
						
							
							
								 
								Doug Coleman
							
						 | 
						
							
							
							
							
								
							
								ccdef35949
								
							
								
							
						 | 
						
							
							
								
								inline everything so stuff compiles
							
							
							
							
							
						 | 
						
							2009-03-22 19:25:55 -05:00 | 
						
					
				
					
						
							
							
								 
								Doug Coleman
							
						 | 
						
							
							
							
							
								
							
								7f0aba9de0
								
							
								
							
						 | 
						
							
							
								
								uncomment unit tests and fix parser.state....
							
							
							
							
							
						 | 
						
							2009-03-15 13:27:36 -05:00 | 
						
					
				
					
						
							
							
								 
								Daniel Ehrenberg
							
						 | 
						
							
							
							
							
								
							
								b206c5a2d1
								
							
								
							
						 | 
						
							
							
								
								Finishing getting rid of html.elements from basis
							
							
							
							
							
						 | 
						
							2009-02-06 10:54:13 -06:00 | 
						
					
				
					
						
							
							
								 
								Daniel Ehrenberg
							
						 | 
						
							
							
							
							
								
							
								984b68d00b
								
							
								
							
						 | 
						
							
							
								
								Fixing HTML dependancy on state-parser
							
							
							
							
							
						 | 
						
							2009-01-21 21:57:44 -06:00 | 
						
					
				
					
						
							
							
								 
								Doug Coleman
							
						 | 
						
							
							
							
							
								
							
								e626431a7e
								
							
								
							
						 | 
						
							
							
								
								Refactor html.parser.utils a bit
							
							
							
							
							
						 | 
						
							2009-01-04 11:38:25 -06:00 | 
						
					
				
					
						
							
							
								 
								Doug Coleman
							
						 | 
						
							
							
							
							
								
							
								0329083970
								
							
								
							
						 | 
						
							
							
								
								remove >r r>
							
							
							
							
							
						 | 
						
							2008-12-14 00:38:37 -06:00 | 
						
					
				
					
						
							
							
								 
								Doug Coleman
							
						 | 
						
							
							
							
							
								
							
								c75777b7a2
								
							
								
							
						 | 
						
							
							
								
								swap ... 3append -> surround in extra
							
							
							
							
							
						 | 
						
							2008-12-06 18:58:05 -06:00 | 
						
					
				
					
						
							
							
								 
								Doug Coleman
							
						 | 
						
							
							
							
							
								
							
								829c379c49
								
							
								
							
						 | 
						
							
							
								
								remove >r r>
							
							
							
							
							
						 | 
						
							2008-11-29 12:18:28 -06:00 | 
						
					
				
					
						
							
							
								 
								Slava Pestov
							
						 | 
						
							
							
							
							
								
							
								e6fbd4f84f
								
							
								
							
						 | 
						
							
							
								
								fix compile errors
							
							
							
							
							
						 | 
						
							2008-11-16 16:59:25 -06:00 |