82 Text Processing packages and projects
-
GoQuery
9.7 5.3 GoGoQuery brings a syntax and a set of features similar to jQuery to the Go language. -
mxj
6.9 5.7 GoEncode / decode XML as JSON or map[string]interface{}; extract values with dot-notation paths and wildcards. Replaces x2j and j2x packages. -
go-pkg-rss
6.4 0.0 GoThis package reads RSS and Atom feeds and provides a caching mechanism that adheres to the feed specs. -
htmlquery
6.2 1.2 GoAn XPath query package for HTML, lets you extract data or evaluate from HTML documents by an XPath expression. -
go-edlib
5.6 7.4 GoGo string comparison and edit distance algorithms library (Levenshtein, LCS, Hamming, Damerau levenshtein, Jaro-Winkler, etc.) compatible with Unicode. -
goq
5.3 0.0 GoDeclarative unmarshaling of HTML using struct tags with jQuery syntax (uses GoQuery). -
html-to-markdown
5.2 6.1 GoConvert HTML to Markdown. Even works with entire websites and can be extended through rules. -
go-pkg-xmlx
5.1 0.0 GoExtension to the standard Go XML package. Maintains a node tree that allows forward/backwards browsing and exposes some simple single/multi-node search functions. -
github_flavored_markdown
5.0 0.0 GoGitHub Flavored Markdown renderer with fenced code block highlighting, clickable header anchor links. -
omniparser
4.8 7.8 GoA versatile ETL library that parses text input (CSV/txt/JSON/XML/EDI/X12/EDIFACT/etc) in streaming fashion and transforms data into JSON output using data-driven schema. -
regroup
3.5 6.3 GoMatch regex expression named groups into go struct using struct tags and automatic parsing. -
csvplus
3.4 0.1 Gocsvplus extends the standard Go encoding/csv package with fluent interface, lazy stream operations, indices and joins. -
pagser
2.1 5.7 GoPagser is a simple, extensible, configurable parse and deserialize html page to struct based on goquery and struct tags for golang crawler. -
codetree
2.0 0.0 GoParses indented code (python, pixy, scarlet, etc.) and returns a tree structure. -
Markov Chain Algorithm
0.3 0.0 GoA Markov chain algorithm generates text by creating a statistical model of potential textual suffixes for a given prefix.
Get performance insights in less than 4 minutes
Scout APM uses tracing logic that ties bottlenecks to source code so you know the exact line of code causing performance issues and can get back to building a great product faster.
Sponsored
scoutapm.com