aboutsummaryrefslogtreecommitdiffstats
path: root/src/construct/html_text.rs (unfollow)
Commit message (Collapse)AuthorFilesLines
2022-08-09Rewrite algorithm to not pass around boxed functionsLibravatar Titus Wormer1-89/+94
* Pass state names from an enum around instead of boxed functions * Refactor to simplify attempts a lot * Use a subtokenizer for the the `document` content type
2022-08-02Refactor to remove most closuresLibravatar Titus Wormer1-35/+60
2022-08-01Refactor some statesLibravatar Titus Wormer1-2/+2
2022-07-29Refactor to improve statesLibravatar Titus Wormer1-19/+27
* Remove custom kind wrappers, use plain bytes instead * Remove `Into`s, use the explicit expected types instead * Refactor to use `slice.as_str` in most places * Remove unneeded unique check before adding a definition * Use a shared CDATA prefix in constants * Inline byte checks into matches * Pass bytes back from parser instead of whole parse state * Refactor to work more often on bytes * Rename custom `size` to `len`
2022-07-29Refactor to work on bytes (`u8`)Libravatar Titus Wormer1-54/+54
2022-07-28Refactor to work on `char`sLibravatar Titus Wormer1-91/+70
Previously, a custom char implementation was used. This was easier to work with, as sometimes “virtual” characters are injected, or characters are ignored. This replaces that with working on actual `char`s. In the hope of in the future working on `u8`s, even. This simplifies the state machine somewhat, as only `\n` is fed, regardless of whether it was a CRLF, CR, or LF. It also feeds `' '` instead of virtual spaces. The BOM, if present, is now available as a `ByteOrderMark` event.
2022-07-25Refactor to not pass codes aroundLibravatar Titus Wormer1-147/+133
2022-07-25Remove no longer needed field in `State::Ok`Libravatar Titus Wormer1-1/+1
2022-07-22Refactor to remove unneeded tuples in every statesLibravatar Titus Wormer1-123/+93
2022-07-22Refactor to pass ints instead of vecs aroundLibravatar Titus Wormer1-56/+56
2022-07-18Add support for turning off constructsLibravatar Titus Wormer1-1/+1
2022-07-18Refactor examples of statesLibravatar Titus Wormer1-63/+77
2022-07-07Refactor to move token types to `token`Libravatar Titus Wormer1-11/+12
2022-07-05Refactor code styleLibravatar Titus Wormer1-43/+37
2022-06-22Add `attempt_opt` to tokenizerLibravatar Titus Wormer1-2/+2
2022-06-22Add docs for token typesLibravatar Titus Wormer1-0/+5
2022-06-21Refactor to improve a bunch of statesLibravatar Titus Wormer1-12/+8
* Improve passing stuff around * Add traits to enums for markers and such * Fix “life time” stuff I didn’t understand
2022-06-20Add improved whitespace handlingLibravatar Titus Wormer1-5/+4
* add several helpers for parsing betwen x and y `space_or_tab`s * use those helpers in a bunch of places * move initial indent parsing to flow constructs themselves
2022-06-20Remove unneeded `content` content typeLibravatar Titus Wormer1-3/+3
2022-06-20Remove unneeded `pub` keywordsLibravatar Titus Wormer1-32/+29
2022-06-14Fix support for deep subtokenizationLibravatar Titus Wormer1-10/+14
* Fix a couple of forgotten line ending handling in html (text) * Fix missing initial case for html (text) not having a `<` 😬 * Add line ending handling to `text` construct
2022-06-14Add docs for html (text)Libravatar Titus Wormer1-55/+272
2022-06-13Add basic html (text)Libravatar Titus Wormer1-0/+480
* Add all states for html (text) * Fix to link paragraph tokens together * Add note about uncovered bug where linking paragraph tokens together doesn’t work 😅