10

8

6

4

2


9.9

7.9

9.7

5.4

9.2
0.0

9.2

8.0
sh

9.0

7.6

8.8

4.9

96 Text Processing packages and projects

  • micro-editor

    9.9 7.9 Go
    A modern and intuitive terminal-based text editor
  • GoQuery

    9.7 5.4 Go
    A little like that j-thing, only in Go.
  • Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.
  • blackfriday

    9.2 0.0 Go
    Blackfriday: a markdown processor for Go
  • sh

    9.2 8.0 Go
    A shell parser, formatter, and interpreter with bash support; includes shfmt
  • toml

    9.0 7.6 Go
    TOML parser for Golang with reflection.
  • go-humanize

    8.8 4.9 Go
    Go Humans! (formatters for units to human friendly sizes)
  • goldmark

    8.6 7.4 Go
    :trophy: A markdown parser written in Go. Easy to extend, standard(CommonMark) compliant, well structured.
  • bluemonday

    8.5 6.6 Go
    bluemonday: a fast golang HTML sanitizer (inspired by the OWASP Java HTML Sanitizer) to scrub user generated content of XSS
  • gofeed

    8.3 5.7 Go
    Parse RSS, Atom and JSON feeds in Go
  • inject

    7.9 0.0 Go
    Package inject provides a reflect based injector.
  • xurls

    7.5 6.9 Go
    Extract urls from text
  • slug

    7.4 0.0 Go
    URL-friendly slugify with multiple languages support.
  • commonregex

    7.2 0.0 Go
    🍫 A collection of common regular expressions for Go
  • html-to-markdown

    7.0 6.4 Go
    ⚙️ Convert HTML to Markdown. Even works with entire websites and can be extended through rules.
  • htmlquery

    6.9 2.2 Go
    htmlquery is golang XPath package for HTML query.
  • mxj

    6.9 2.3 Go
    Decode / encode XML to/from map[string]interface{} (or JSON); extract values with dot-notation paths and wildcards. Replaces x2j and j2x packages.
  • Dataflow kit

    6.9 0.0 Go
    Extract structured data from web sites. Web sites scraping.
  • go-runewidth

    6.8 2.6 Go
    wcwidth for golang
  • xpath

    6.8 3.2 Go
    XPath package for Golang, supports HTML, XML, JSON document query.
  • omniparser

    6.7 4.6 Go
    omniparser: a native Golang ETL streaming parser and transform library for CSV, JSON, XML, EDI, text, etc.
  • gographviz

    6.6 1.4 Go
    Parses the Graphviz DOT language in golang
  • Koazee

    6.5 0.0 Go
    A StreamLike, Immutable, Lazy Loading and smart Golang Library to deal with slices.
  • go-pkg-rss

    6.4 0.0 Go
    This package reads RSS and Atom feeds and provides a caching mechanism that adheres to the feed specs.
  • gotext

    6.3 5.6 Go
    Go (Golang) GNU gettext utilities package
  • go-edlib

    6.2 1.8 Go
    📚 String comparison and edit distance algorithms library, featuring : Levenshtein, LCS, Hamming, Damerau levenshtein (OSA and Adjacent transpositions algorithms), Jaro-Winkler, Cosine, etc...
  • gotabulate

    5.8 0.0 Go
    Gotabulate - Easily pretty-print your tabular data with Go
  • go-nmea

    5.7 3.0 Go
    A NMEA parser library in pure Go
  • goribot

    5.5 6.1 Go
    A simple golang spider/scraping framework,build a spider in 3 lines.
  • strutil-go

    5.4 4.5 Go
    Golang metrics for calculating string similarity and other string utility functions
  • goq

    5.4 0.0 Go
    A declarative struct-tag-based HTML unmarshaling or scraping package for Go built on top of the goquery library
  • xquery

    5.3 0.0 Go
    XQuery lets you extract data from HTML/XML documents using XPath expression.
  • gospider

    5.2 3.6 Go
    ⚡ Light weight Golang spider framework | 轻量的 Golang 爬虫框架
  • github_flavored_markdown

    5.2 0.0 Go
    GitHub Flavored Markdown renderer with fenced code block highlighting, clickable header anchor links.
  • go-pkg-xmlx

    5.1 0.0 Go
    Extension to the standard Go XML package. Maintains a node tree that allows forward/backwards browsing and exposes some simple single/multi-node search functions.
  • editorconfig-core-go

    5.0 6.7 Go
    EditorConfig Core written in Go
  • radix

    5.0 0.0 Go
    A fast string sorting algorithm (MSD radix sort)
  • sdp

    4.9 0.0 Go
    RFC 4566 SDP implementation in go
  • shell2telegram

    4.9 4.6 Go
    Telegram bot constructor from command-line
  • podcast

    4.8 0.0 Go
    iTunes and RSS 2.0 Podcast Generator in Golang
  • go-vcard

    4.6 4.4 Go
    A Go library to parse and format vCard
  • did

    4.5 0.0 Go
    A golang package to work with Decentralized Identifiers (DIDs)
  • regroup

    4.5 4.2 Go
    Match regex group into go struct using struct tags and automatic parsing
  • go-fixedwidth

    4.3 4.4 Go
    Encoding and decoding for fixed-width formatted data
  • goregen

    4.2 0.0 Go
    randexp for Go.
  • go-zero-width

    4.2 0.0 Go
    Zero-width character detection and removal for Go
  • cat

    4.1 3.8 Go
    Extract text from plaintext, .docx, .odt and .rtf files. Pure go.
  • go-slugify

    3.9 0.0 Go
    Pretty Slug.
  • pagser

    3.9 2.7 Go
    Pagser is a simple, extensible, configurable parse and deserialize html page to struct based on goquery and struct tags for golang crawler
  • bafi

    3.8 6.4 Go
    Universal JSON, BSON, YAML, CSV, XML, mt940 converter with templates
  • align

    3.8 1.8 Go
    A general purpose application and library for aligning text.
  • Ren'Py graph vizualiser

    3.8 5.6 Go
    Draws a flowchart graph of any Visual Novel from Renpy .rpy files !
  • genex

    3.7 0.0 Go
    Genex package for Go
  • frontmatter

    3.7 4.2 Go
    Go library for detecting and decoding various content front matter formats
  • go-wildcard

    3.7 5.6 Go
    🚀 Fast and light wildcard pattern matching.
  • ODF

    3.6 2.8 Go
    Open Document Format (ODF) generator library for Go.
  • Go Typed Redis Streams

    3.5 6.8 Go
    Turn Redis streams into typed Go channels in just a few lines
  • tokenizer

    3.4 3.6 Go
    Tokenizer (lexer) for golang
  • csvplus

    3.3 0.0 Go
    csvplus extends the standard Go encoding/csv package with fluent interface, lazy stream operations, indices and joins.
  • allot

    3.2 0.0 Go
    Parse placeholder and wildcard text commands
  • gofuckyourself

    3.2 0.0 Go
    A sanitization-based swear filter for Go.
  • Goa

    3.2 0.0 Go
    The Go library that will drive you to AOP world!
  • html2data

    3.1 4.5 Go
    Library and cli for extracting data from HTML via CSS selectors
  • parth

    2.9 0.0 Go
    Path parsing for segment unmarshaling and slicing.
  • guesslanguage

    2.9 0.0 Go
    Guess the natural language of a text in Go
  • xj2go

    2.7 3.6 Go
    Convert xml and json to go struct
  • GO Transliterator

    2.7 0.0 Go
    Golang text Transliterator (i.e München -> Muenchen)
  • Go Mathematical Expression Toolkit

    2.6 0.0 Go
    Go Mathematical Expression Toolkit. Run-time mathematical expression parser and evaluation engine.
  • gonameparts

    2.5 0.0 Go
    Takes a full name and splits it into individual name parts
  • normalize

    2.5 0.0 Go
    Sanitize, normalize and compare fuzzy text.
  • Slugify

    2.4 0.0 Go
    A Go slugify application that handles string
  • Edifact

    2.3 5.8 Go
    Read edifact into your Go data structures.
  • Tagify

    2.2 4.7 HTML
    Tagify produces a set of tags from a given source. Source can be either an HTML page, a Markdown document or a plain text. Supports English, Russian, Chinese, Hindi, Spanish, Arabic, Japanese, German, Hebrew, French and Korean languages.
  • codetree

    2.1 0.0 Go
    :evergreen_tree: Parses indented code and returns a tree structure.
  • go-fasttld

    1.9 7.5 Go
    go-fasttld is a high performance effective top level domains (eTLD) extraction module.
  • go-recipe

    1.8 1.4 Go
    Go package for scraping website recipes
  • kace

    1.7 0.0 Go
    Common case conversions covering common initialisms.
  • jsoncolor

    1.7 0.7 Go
    Colorized JSON output for Go https://godoc.org/github.com/nwidger/jsoncolor
  • TySug

    1.6 1.9 Go
    A project around helping to prevent typing typos. TySug (Typo Suggestions) suggests alternative words with respect to keyboard layouts
  • enca

    1.6 0.0 Go
    Minimal cgo bindings for libenca
  • syndfeed

    1.4 0.0 Go
    A syndication feed parser for Atom 1.0 and RSS 2.0 in Go
  • parseargs-go

    1.4 0.0 Go
    A string argument parser that understands quotes and backslashes
  • jsonpath

    1.2 6.1 Go
    JSONPath with dot notation generator for golang
  • textwrap

    1.1 0.0 Go
    Port of Python's "textwrap" module to Go
  • bbConvert

    1.1 3.2 Go
    Converter from BBCode to HTML
  • go-output-format

    1.1 0.0 Go
    Output go objects in standard formats, such as YAML, JSON, etc
  • htmlyaml

    1.0 3.7 Go
    🐹 render YAML as HTML in Go
  • Bookgot

    1.0 0.0 Go
    Simple script for farm free books from PackPub.com
  • ltsv

    0.9 0.0 Go
    High performance LTSV (Labeled Tab Separeted Value) reader for Go.
  • encdec

    0.8 0.0 Go
    Go package provides a generic interface to encoders and decoders
  • doi

    0.8 0.0 Go
    Parse and check doi objects in go.
  • Paragraphical

    0.7 0.0 Go
    A simple library for formatting and laying out text in your terminal
  • walker

    0.7 0.0 Go
    Seamlessly fetch paginated data from any source. Simple and high performance API scraping included!
  • xml

    0.4 0.0 Go
    Package feed implements a flexible, robust and efficient RSS and Atom parser
  • godazo

    0.4 0.0 Go
    Stupid simple slide presenter or static site creator.
  • gommon/bytes

    0.2 -
    Format bytes to string.
  • Markov Chain Algorithm

    0.2 0.0 Go
    A Markov chain algorithm generates text by creating a statistical model of potential textual suffixes for a given prefix.

Add another 'Text Processing' Package