aboutsummaryrefslogtreecommitdiffstats
path: root/src/subtokenize.rs (follow)
Commit message (Expand)AuthorAgeFilesLines
* Add support for unicode punctuationLibravatar Titus Wormer2022-07-041-1/+1
* Update list of todosLibravatar Titus Wormer2022-07-041-2/+0
* Fix jumps in `edit_map`Libravatar Titus Wormer2022-06-281-101/+99
* Add link, images (resource)Libravatar Titus Wormer2022-06-241-12/+26
* Refactor some unneeded assignmentsLibravatar Titus Wormer2022-06-221-2/+1
* Add docs for token typesLibravatar Titus Wormer2022-06-221-1/+3
* Add docs for `subtokenize`Libravatar Titus Wormer2022-06-211-2/+51
* Update todo listLibravatar Titus Wormer2022-06-211-8/+1
* Add support for BOMLibravatar Titus Wormer2022-06-201-0/+4
* Remove unneeded `content` content typeLibravatar Titus Wormer2022-06-201-6/+3
* Fix support for deep subtokenizationLibravatar Titus Wormer2022-06-141-9/+19
* Reorganize to split utilLibravatar Titus Wormer2022-06-141-6/+4
* Add docs for html (text)Libravatar Titus Wormer2022-06-141-0/+1
* Add basic html (text)Libravatar Titus Wormer2022-06-131-3/+9
* Add text content typeLibravatar Titus Wormer2022-06-101-4/+10
* Add proper support for subtokenizationLibravatar Titus Wormer2022-06-101-50/+116
* Add basic subtokenization, string content in fenced codeLibravatar Titus Wormer2022-06-091-0/+67