aboutsummaryrefslogtreecommitdiffstats
path: root/src/tokenizer.rs (unfollow)
Commit message (Collapse)AuthorFilesLines
2022-07-25Refactor to remove need for cloning codesLibravatar Titus Wormer1-33/+21
2022-07-25Improve performance w/ a single feed loopLibravatar Titus Wormer1-97/+90
2022-07-22Refactor to remove unneeded `Option`Libravatar Titus Wormer1-14/+18
2022-07-22Refactor to use a single shared edit mapLibravatar Titus Wormer1-16/+7
2022-07-22Refactor to reserve some more vecsLibravatar Titus Wormer1-5/+9
2022-07-22Refactor to pass more references aroundLibravatar Titus Wormer1-26/+39
2022-07-22Refactor to remove unneeded tuples in every statesLibravatar Titus Wormer1-60/+50
2022-07-22Refactor to pass ints instead of vecs aroundLibravatar Titus Wormer1-74/+47
2022-07-21Refactor to move `index` field to `point`Libravatar Titus Wormer1-19/+10
2022-07-21Refactor to move some event fields to `link`Libravatar Titus Wormer1-11/+24
2022-07-20Refactor to share edit mapLibravatar Titus Wormer1-2/+17
2022-07-20Refactor to use less vecs for eventsLibravatar Titus Wormer1-7/+5
2022-07-19Remove a couple of clonesLibravatar Titus Wormer1-3/+2
2022-07-19Refactor to remove cloning in `edit_map`Libravatar Titus Wormer1-0/+1
2022-07-19Use `edit_map` in `subtokenize`Libravatar Titus Wormer1-2/+0
2022-07-19Remove an unneeded `HashMap`Libravatar Titus Wormer1-15/+43
2022-07-18Fix token that should be voidLibravatar Titus Wormer1-1/+10
2022-07-15Add docs and refactor listLibravatar Titus Wormer1-1/+1
2022-07-15Add docs and refactor document content typeLibravatar Titus Wormer1-1/+10
2022-07-15Add missing docs to tokenizerLibravatar Titus Wormer1-14/+16
2022-07-15Fix annoying bug around virtual spaces in containersLibravatar Titus Wormer1-18/+22
2022-07-14Add support for container stateLibravatar Titus Wormer1-0/+10
* Fix to parse list item continuation based on how big the initial list item prefix was * Fix list items that start with blank lines
2022-07-11Add support for lazy linesLibravatar Titus Wormer1-1/+0
2022-07-11Fix block quote bugsLibravatar Titus Wormer1-5/+6
2022-07-08Fix positional info on line ending after shiftLibravatar Titus Wormer1-4/+17
2022-07-08Fix closing of flow when exiting containersLibravatar Titus Wormer1-14/+30
2022-07-08Add support for concrete constructsLibravatar Titus Wormer1-0/+4
2022-07-07Refactor to move token types to `token`Libravatar Titus Wormer1-1774/+6
2022-07-07Fix some small to dosLibravatar Titus Wormer1-1/+2
2022-07-07Add docs for block quote tokensLibravatar Titus Wormer1-1/+76
2022-07-07Fix order of tokensLibravatar Titus Wormer1-77/+77
2022-07-07Add basic support for block quotesLibravatar Titus Wormer1-12/+26
2022-07-07Add support for `Flow` content typeLibravatar Titus Wormer1-0/+2
2022-07-07Add `go_until` to tokenizerLibravatar Titus Wormer1-7/+37
2022-07-05Refactor to do some to dosLibravatar Titus Wormer1-18/+130
2022-07-05Refactor code styleLibravatar Titus Wormer1-78/+0
2022-07-04Add support for unicode punctuationLibravatar Titus Wormer1-1/+1
2022-07-04Update list of todosLibravatar Titus Wormer1-2/+0
2022-07-04Add support for attention (emphasis, strong)Libravatar Titus Wormer1-0/+9
2022-07-01Make paragraphs really fastLibravatar Titus Wormer1-0/+3
The approach that `micromark-js` takes is as follows: to parse a paragraph, check whether each line starts with something else. If it does, exit, otherwise continue. That is slow, because our actual flow parser does similar things: the work was being done twice. To fix this, this commit introduces parsing each line of a paragraph separately. And finally, when done with flow, combining adjacent paragraphs. This same mechanism is reused for setext headings. Additionally, this commit adds support for interrupting things (or not). E.g., HTML (flow, complete) cannot interrupt paragraphs. Definitions cannot interrupt paragraphs, and connect be interrupted either, but they can follow each other.
2022-06-30Add support for trimming whitespace around string, textLibravatar Titus Wormer1-1/+8
This commit introduces trimming initial and final whitespace around the whole string or text, or around line endings inside that string or text. * Add `register_resolver_before`, to run resolvers earlier than others, used for labels * Add resolver to merge `data` events, which are the most frequent token that occurs, and can happen adjacently. In `micromark-js` this sped up parsing a lot * Fix a bug where a virtual space was not seen as an okay event * Refactor to enable all turned off whitespace tests
2022-06-30Add docs on resolver, clean feedLibravatar Titus Wormer1-77/+97
2022-06-30Refactor to reorder token typesLibravatar Titus Wormer1-303/+303
2022-06-30Add docs to image, link, and other media tokensLibravatar Titus Wormer1-16/+434
2022-06-29Refactor to externalize handlers of compilerLibravatar Titus Wormer1-1/+1
2022-06-28Fix jumps in `edit_map`Libravatar Titus Wormer1-33/+16
* Use resolve more often (e.g., heading (atx, setext)) * Fix to link whole phrasing (e.g., one big chunk of text in heading (atx, setext), titles, labels) * Replace `ChunkText`, `ChunkString`, with `event.content_type: Option<ContentType>` * Refactor to externalize `edit_map` from `label`
2022-06-24Add link, images (resource)Libravatar Titus Wormer1-5/+106
This is still some messy code that needs cleaning up, but it adds support for links and images, of the resource kind (`[a](b)`). References (`[a][b]`) are parsed and will soon be supported, but need matching. * Fix bug to pad percent-encoded bytes when normalizing urls * Fix bug with escapes counting as balancing in destination * Add `space_or_tab_one_line_ending`, to parse whitespace including up to one line ending (but not a blank line) * Add `ParserState` to share codes, definitions, etc
2022-06-22Refactor some unneeded assignmentsLibravatar Titus Wormer1-3/+1
2022-06-22Add `attempt_opt` to tokenizerLibravatar Titus Wormer1-35/+30
2022-06-22Rename `Whitespace` token to `SpaceOrTab`Libravatar Titus Wormer1-15/+15