Repository navigation

#

tokenize

small, safe, and great commonmark (optionally gfm, mdx) compliant markdown parser

JavaScript
1937
17 天前

Boost Engine for Regulation and Security

C++
1174
3 年前

CommonMark compliant markdown parser in Rust with ASTs and extensions

Rust
1130
2 个月前

Ungreedy subword tokenizer and vocabulary trainer for Python, Go & Javascript

Go
576
10 个月前

mdast utility to parse markdown

JavaScript
243
1 个月前

snapdragon is an extremely pluggable, powerful and easy-to-use parser-renderer factory.

JavaScript
226
2 年前

NLP Functions for amplifying negations, managing elisions, creating ngrams, stems, phonetic codes to tokens and more.

JavaScript
127
1 年前

Tokenize2 is a plugin which allows your users to select multiple items from a predefined list or ajax, using autocompletion as they type to find each item. You may have seen a similar type of text entry when filling in the recipients field sending messages on facebook or tags on tumblr.

JavaScript
83
2 年前

Examples scripts that showcase how to use Private AI Text to de-identify, redact, hash, tokenize, mask and synthesize PII in text.

Jupyter Notebook
81
10 天前

Extract JavaScript code comments from a string or glob of files.

JavaScript
49
6 年前

bKash payment gateway integration in flutter

Dart
41
6 个月前

Lexers, tokenizers, parsers, compilers, renderers, stringifiers... What's the difference, and how do they work?

22
8 年前

A token based HTML Document parser and minifier written in PHP. Extract attribute values and text using CSS selectors.

PHP
21
4 个月前

A Python library for interacting with TI-(e)z80 (82/83/84 series) calculator files

Python
19
10 小时前

Uses babel to extract JavaScript code comments from a string. Returns an array of comment objects, with line, column, index, comment type and comment string.

JavaScript
14
7 年前

Uses snapdragon to tokenize a single JavaScript block comment into an object, with description, tags, and code example sections that can be passed to any other comment parsers for further parsing.

JavaScript
14
2 年前

Implemented transformer NN block for Machine translation, text classfication, Natural language inference as well as Machine reading comprehension model.

Python
11
1 年前