Commit message (Collapse) | Author | Files | Lines | ||
---|---|---|---|---|---|
2022-08-11 | Refactor internal docs, code style of tokenizer | Titus Wormer | 1 | -7/+3 | |
2022-08-10 | Add `State::Retry` | Titus Wormer | 1 | -2/+2 | |
2022-08-10 | Rename `State::Fn` to `State::Next` | Titus Wormer | 1 | -21/+21 | |
2022-08-09 | Add support for passing `ok`, `nok` as separate states to attempts | Titus Wormer | 1 | -21/+64 | |
2022-08-09 | Rewrite algorithm to not pass around boxed functions | Titus Wormer | 1 | -23/+22 | |
* Pass state names from an enum around instead of boxed functions * Refactor to simplify attempts a lot * Use a subtokenizer for the the `document` content type | |||||
2022-08-02 | Refactor to remove most closures | Titus Wormer | 1 | -9/+13 | |
2022-07-29 | Refactor to work on bytes (`u8`) | Titus Wormer | 1 | -10/+10 | |
2022-07-28 | Refactor to work on `char`s | Titus Wormer | 1 | -12/+12 | |
Previously, a custom char implementation was used. This was easier to work with, as sometimes “virtual” characters are injected, or characters are ignored. This replaces that with working on actual `char`s. In the hope of in the future working on `u8`s, even. This simplifies the state machine somewhat, as only `\n` is fed, regardless of whether it was a CRLF, CR, or LF. It also feeds `' '` instead of virtual spaces. The BOM, if present, is now available as a `ByteOrderMark` event. | |||||
2022-07-26 | Refactor to drastically improve perf around whitespace | Titus Wormer | 1 | -26/+31 | |
2022-07-25 | Refactor to not pass codes around | Titus Wormer | 1 | -5/+5 | |
2022-07-25 | Remove no longer needed field in `State::Ok` | Titus Wormer | 1 | -1/+1 | |
2022-07-22 | Refactor to remove unneeded tuples in every states | Titus Wormer | 1 | -4/+4 | |
2022-07-22 | Refactor to pass ints instead of vecs around | Titus Wormer | 1 | -1/+1 | |
2022-07-21 | Refactor to improve performance by passing markers around | Titus Wormer | 1 | -46/+19 | |
2022-07-18 | Refactor to improve performance when constructs are turned off | Titus Wormer | 1 | -27/+48 | |
2022-07-18 | Add support for turning off constructs | Titus Wormer | 1 | -0/+1 | |
2022-07-05 | Refactor code style | Titus Wormer | 1 | -4/+4 | |
2022-07-04 | Add support for attention (emphasis, strong) | Titus Wormer | 1 | -7/+11 | |
2022-06-30 | Add support for trimming whitespace around string, text | Titus Wormer | 1 | -9/+13 | |
This commit introduces trimming initial and final whitespace around the whole string or text, or around line endings inside that string or text. * Add `register_resolver_before`, to run resolvers earlier than others, used for labels * Add resolver to merge `data` events, which are the most frequent token that occurs, and can happen adjacently. In `micromark-js` this sped up parsing a lot * Fix a bug where a virtual space was not seen as an okay event * Refactor to enable all turned off whitespace tests | |||||
2022-06-30 | Refactor some docs, fix some grammar | Titus Wormer | 1 | -2/+3 | |
2022-06-24 | Add link, images (resource) | Titus Wormer | 1 | -5/+12 | |
This is still some messy code that needs cleaning up, but it adds support for links and images, of the resource kind (`[a](b)`). References (`[a][b]`) are parsed and will soon be supported, but need matching. * Fix bug to pad percent-encoded bytes when normalizing urls * Fix bug with escapes counting as balancing in destination * Add `space_or_tab_one_line_ending`, to parse whitespace including up to one line ending (but not a blank line) * Add `ParserState` to share codes, definitions, etc | |||||
2022-06-22 | Refactor to improve tokenizer, add docs | Titus Wormer | 1 | -8/+10 | |
2022-06-21 | Make data a construct | Titus Wormer | 1 | -41/+12 | |
2022-06-16 | Refactor to reorder thing alphabetically | Titus Wormer | 1 | -1/+0 | |
2022-06-16 | Add support for hard break (trailing) | Titus Wormer | 1 | -6/+9 | |
2022-06-16 | Add support for hard break escape | Titus Wormer | 1 | -3/+4 | |
2022-06-15 | Add code (text) | Titus Wormer | 1 | -4/+6 | |
2022-06-14 | Fix support for deep subtokenization | Titus Wormer | 1 | -7/+14 | |
* Fix a couple of forgotten line ending handling in html (text) * Fix missing initial case for html (text) not having a `<` 😬 * Add line ending handling to `text` construct | |||||
2022-06-14 | Add docs for html (text) | Titus Wormer | 1 | -1/+1 | |
2022-06-13 | Add basic html (text) | Titus Wormer | 1 | -5/+9 | |
* Add all states for html (text) * Fix to link paragraph tokens together * Add note about uncovered bug where linking paragraph tokens together doesn’t work 😅 | |||||
2022-06-13 | Add autolinks | Titus Wormer | 1 | -4/+5 | |
2022-06-10 | Add text content type | Titus Wormer | 1 | -14/+16 | |
* Add character reference and character escapes in text * Add recursive subtokenization | |||||
2022-06-10 | Add proper support for subtokenization | Titus Wormer | 1 | -36/+6 | |
- Add “content” content type - Add paragraph - Add skips - Add linked tokens | |||||
2022-06-09 | Add basic subtokenization, string content in fenced code | Titus Wormer | 1 | -16/+4 | |
2022-06-09 | Refactor to pass more slices around | Titus Wormer | 1 | -1/+1 | |