Popularity
3.8
Growing
Activity
1.8
Declining
63
4
14
Description
This is a simple libary to extract text from plaintext, .docx, .odt, .pdf and .rtf files.
Programming language: Go
License: The Unlicense
Latest version: v0.1.5
cat alternatives and similar packages
Based on the "Specific Formats" category.
Alternatively, view cat alternatives based on common mentions on social networks and blogs.
-
bluemonday
bluemonday: a fast golang HTML sanitizer (inspired by the OWASP Java HTML Sanitizer) to scrub user generated content of XSS -
mxj
Decode / encode XML to/from map[string]interface{} (or JSON); extract values with dot-notation paths and wildcards. Replaces x2j and j2x packages. -
omniparser
omniparser: a native Golang ETL streaming parser and transform library for CSV, JSON, XML, EDI, text, etc. -
go-pkg-rss
This package reads RSS and Atom feeds and provides a caching mechanism that adheres to the feed specs. -
html-to-markdown
⚙️ Convert HTML to Markdown. Even works with entire websites and can be extended through rules. -
goq
A declarative struct-tag-based HTML unmarshaling or scraping package for Go built on top of the goquery library -
xquery
XQuery lets you extract data from HTML/XML documents using XPath expression. -
github_flavored_markdown
GitHub Flavored Markdown renderer with fenced code block highlighting, clickable header anchor links. -
go-pkg-xmlx
Extension to the standard Go XML package. Maintains a node tree that allows forward/backwards browsing and exposes some simple single/multi-node search functions. -
csvplus
csvplus extends the standard Go encoding/csv package with fluent interface, lazy stream operations, indices and joins. -
pagser
Pagser is a simple, extensible, configurable parse and deserialize html page to struct based on goquery and struct tags for golang crawler -
codetree
:evergreen_tree: Parses indented code and returns a tree structure. -
go-wildcard
Fast and light wildcard pattern matching. Fork from Minio project. -
jsoncolor
Colorized JSON output for Go https://godoc.org/github.com/nwidger/jsoncolor
Less time debugging, more time building
Scout APM allows you to find and fix performance issues with no hassle. Now with error monitoring and external services monitoring, Scout is a developer's best friend when it comes to application development.
Promo
scoutapm.com
Do you think we are missing an alternative of cat or a related project?
Popular Comparisons
README
cat
This is a simple libary to extract text from plaintext, .docx, .odt, .pdf and .rtf files.
Install
go get -u github.com/lu4p/cat
Basic Usage
package main
import (
"fmt"
"github.com/lu4p/cat"
)
func main(){
txt, _ := cat.File("filename")
fmt.Println(txt)
}
*Note that all licence references and agreements mentioned in the cat README section above
are relevant to that project's source code only.