Popularity

3.3

Stable

Activity

0.0

Stable

Stars 66

Watchers 5

Forks 3

Last Commit almost 3 years ago

Description

Package csvplus extends the standard Go encoding/csv package with fluent interface, lazy stream processing operations, indices and joins.

The library is primarily designed for ETL-like processes. It is mostly useful in places where the more advanced searching/joining capabilities of a fully-featured SQL database are not required, but the same time the data transformations needed still include SQL-like operations.

Programming language: Go

License: BSD 3-clause "New" or "Revised" License

Tags: Utility Text Processing Specific Formats

Latest version: v0.3.3

csvplus alternatives and similar packages

Based on the "Specific Formats" category.
Alternatively, view csvplus alternatives based on common mentions on social networks and blogs.

GoQuery

9.7 6.6 csvplus VS GoQuery

A little like that j-thing, only in Go.
sh

9.2 7.6 csvplus VS sh

A shell parser, formatter, and interpreter with bash support; includes shfmt

WorkOS - The modern identity platform for B2B SaaS

The APIs are flexible and easy-to-use, supporting authentication, user identity, and complex enterprise features like SSO and SCIM provisioning.

Promo workos.com

blackfriday

9.1 0.0 csvplus VS blackfriday

Blackfriday: a markdown processor for Go
toml

9.0 7.6 csvplus VS toml

TOML parser for Golang with reflection.
go-humanize

8.8 2.7 csvplus VS go-humanize

Go Humans! (formatters for units to human friendly sizes)
bluemonday

8.5 5.1 csvplus VS bluemonday

bluemonday: a fast golang HTML sanitizer (inspired by the OWASP Java HTML Sanitizer) to scrub user generated content of XSS
gofeed

8.4 6.1 csvplus VS gofeed

Parse RSS, Atom and JSON feeds in Go
inject

7.9 0.0 csvplus VS inject

DISCONTINUED. Package inject provides a reflect based injector.
slug

7.4 5.5 csvplus VS slug

URL-friendly slugify with multiple languages support.
commonregex

7.2 0.0 csvplus VS commonregex

🍫 A collection of common regular expressions for Go
htmlquery

6.9 5.3 csvplus VS htmlquery

htmlquery is golang XPath package for HTML query.
html-to-markdown

6.9 4.7 csvplus VS html-to-markdown

⚙️ Convert HTML to Markdown. Even works with entire websites and can be extended through rules.
mxj

6.9 4.5 csvplus VS mxj

Decode / encode XML to/from map[string]interface{} (or JSON); extract values with dot-notation paths and wildcards. Replaces x2j and j2x packages.
go-runewidth

6.8 1.8 csvplus VS go-runewidth

wcwidth for golang
xpath

6.8 6.8 csvplus VS xpath

XPath package for Golang, supports HTML, XML, JSON document query.
omniparser

6.7 4.6 csvplus VS omniparser

omniparser: a native Golang ETL streaming parser and transform library for CSV, JSON, XML, EDI, text, etc.
gographviz

6.6 1.4 csvplus VS gographviz

Parses the Graphviz DOT language in golang
go-pkg-rss

6.4 0.0 csvplus VS go-pkg-rss

DISCONTINUED. This package reads RSS and Atom feeds and provides a caching mechanism that adheres to the feed specs.
gotext

6.3 5.1 csvplus VS gotext

Go (Golang) GNU gettext utilities package
go-nmea

5.7 3.0 csvplus VS go-nmea

A NMEA parser library in pure Go
goribot

5.5 6.1 csvplus VS goribot

DISCONTINUED. A simple golang spider/scraping framework,build a spider in 3 lines.
goq

5.4 0.0 csvplus VS goq

A declarative struct-tag-based HTML unmarshaling or scraping package for Go built on top of the goquery library
xquery

5.3 0.0 csvplus VS xquery

DISCONTINUED. XQuery lets you extract data from HTML/XML documents using XPath expression.
github_flavored_markdown

5.2 0.0 csvplus VS github_flavored_markdown

GitHub Flavored Markdown renderer with fenced code block highlighting, clickable header anchor links.
gospider

5.2 3.6 csvplus VS gospider

⚡ Light weight Golang spider framework | 轻量的 Golang 爬虫框架
go-pkg-xmlx

5.1 0.0 csvplus VS go-pkg-xmlx

DISCONTINUED. Extension to the standard Go XML package. Maintains a node tree that allows forward/backwards browsing and exposes some simple single/multi-node search functions.
editorconfig-core-go

5.0 6.5 csvplus VS editorconfig-core-go

EditorConfig Core written in Go
sdp

4.9 0.0 csvplus VS sdp

DISCONTINUED. RFC 4566 SDP implementation in go
podcast

4.8 0.0 csvplus VS podcast

iTunes and RSS 2.0 Podcast Generator in Golang
go-vcard

4.6 4.4 csvplus VS go-vcard

A Go library to parse and format vCard
did

4.5 0.0 csvplus VS did

A golang package to work with Decentralized Identifiers (DIDs)
go-fixedwidth

4.3 4.4 csvplus VS go-fixedwidth

Encoding and decoding for fixed-width formatted data
cat

4.2 3.8 csvplus VS cat

Extract text from plaintext, .docx, .odt and .rtf files. Pure go.
goregen

4.2 0.0 csvplus VS goregen

randexp for Go.
go-zero-width

4.2 0.0 csvplus VS go-zero-width

DISCONTINUED. Zero-width character detection and removal for Go
go-slugify

3.9 0.0 csvplus VS go-slugify

Pretty Slug.
pagser

3.9 2.7 csvplus VS pagser

Pagser is a simple, extensible, configurable parse and deserialize html page to struct based on goquery and struct tags for golang crawler
bafi

3.8 6.7 csvplus VS bafi

Universal JSON, BSON, YAML, CSV, XML, mt940 converter with templates
align

3.8 1.8 csvplus VS align

A general purpose application and library for aligning text.
genex

3.7 0.0 csvplus VS genex

Genex package for Go
go-wildcard

3.7 5.6 csvplus VS go-wildcard

🚀 Fast and light wildcard pattern matching.
ODF

3.6 2.8 csvplus VS ODF

Open Document Format (ODF) generator library for Go.
guesslanguage

2.9 0.0 csvplus VS guesslanguage

Guess the natural language of a text in Go
normalize

2.6 0.0 csvplus VS normalize

Sanitize, normalize and compare fuzzy text.
gonameparts

2.5 0.0 csvplus VS gonameparts

Takes a full name and splits it into individual name parts
Slugify

2.4 0.0 csvplus VS Slugify

A Go slugify application that handles string
codetree

2.1 0.0 csvplus VS codetree

:evergreen_tree: Parses indented code and returns a tree structure.
jsoncolor

1.7 0.7 csvplus VS jsoncolor

Colorized JSON output for Go https://godoc.org/github.com/nwidger/jsoncolor
enca

1.5 0.0 csvplus VS enca

Minimal cgo bindings for libenca
syndfeed

1.4 0.0 csvplus VS syndfeed

A syndication feed parser for Atom 1.0 and RSS 2.0 in Go

Do you think we are missing an alternative of csvplus or a related project?

Add another 'Specific Formats' Package

Popular Comparisons

README

csvplus

Package csvplus extends the standard Go encoding/csv package with fluent interface, lazy stream processing operations, indices and joins.

The library is primarily designed for ETL-like processes. It is mostly useful in places where the more advanced searching/joining capabilities of a fully-featured SQL database are not required, but the same time the data transformations needed still include SQL-like operations.

License: BSD

Examples

Simple sequential processing:

people := csvplus.FromFile("people.csv").SelectColumns("name", "surname", "id")

err := csvplus.Take(people).
    Filter(csvplus.Like(csvplus.Row{"name": "Amelia"})).
    Map(func(row csvplus.Row) csvplus.Row { row["name"] = "Julia"; return row }).
    ToCsvFile("out.csv", "name", "surname")

if err != nil {
    return err
}

More involved example:

customers := csvplus.FromFile("people.csv").SelectColumns("id", "name", "surname")
custIndex, err := csvplus.Take(customers).UniqueIndexOn("id")

if err != nil {
    return err
}

products := csvplus.FromFile("stock.csv").SelectColumns("prod_id", "product", "price")
prodIndex, err := csvplus.Take(products).UniqueIndexOn("prod_id")

if err != nil {
    return err
}

orders := csvplus.FromFile("orders.csv").SelectColumns("cust_id", "prod_id", "qty", "ts")
iter := csvplus.Take(orders).Join(custIndex, "cust_id").Join(prodIndex)

return iter(func(row csvplus.Row) error {
    // prints lines like:
    //  John Doe bought 38 oranges for £0.03 each on 2016-09-14T08:48:22+01:00
    _, e := fmt.Printf("%s %s bought %s %ss for £%s each on %s\n",
        row["name"], row["surname"], row["qty"], row["product"], row["price"], row["ts"])
    return e
})

Design principles

The package functionality is based on the operations on the following entities:

type Row
type DataSource
type Index

Type `Row`

Row represents one row from a DataSource. It is a map from column names to the string values under those columns on the current row. The package expects a unique name assigned to every column at source. Compared to using integer indices this provides more convenience when complex transformations get applied to each row during processing.

type `DataSource`

Type DataSource represents any source of zero or more rows, like .csv file. This is a function that when invoked feeds the given callback with the data from its source, one Row at a time. The type also has a number of operations defined on it that provide for easy composition of the operations on the DataSource, forming so called fluent interface. All these operations are 'lazy', i.e. they are not performed immediately, but instead each of them returns a new DataSource.

There is also a number of convenience operations that actually invoke the DataSource function to produce a specific type of output:

IndexOn to build an index on the specified column(s);
UniqueIndexOn to build a unique index on the specified column(s);
ToCsv to serialise the DataSource to the given io.Writer in .csv format;
ToCsvFile to store the DataSource in the specified file in .csv format;
ToJSON to serialise the DataSource to the given io.Writer in JSON format;
ToJSONFile to store the DataSource in the specified file in JSON format;
ToRows to convert the DataSource to a slice of Rows.

Type `Index`

Index is a sorted collection of rows. The sorting is performed on the columns specified when the index is created. Iteration over an index yields a sorted sequence of rows. An Index can be joined with a DataSource. The type has operations for finding rows and creating sub-indices in O(log(n)) time. Another useful operation is resolving duplicates. Building an index takes O(n*log(n)) time. It should be noted that the Index building operation requires the entire dataset to be read into the memory, so certain care should be taken when indexing huge datasets. An index can also be stored to, or loaded from a disk file.

For more details see the documentation.

Project status

The project is in a usable state usually called "beta". Tested on Linux Mint 18.3 using Go version 1.10.2.

*Note that all licence references and agreements mentioned in the csvplus README section above are relevant to that project's source code only.

csvplus

csvplus extends the standard Go encoding/csv package with fluent interface, lazy stream operations, indices and joins.

Description

csvplus alternatives and similar packages

Popular Comparisons

README

csvplus

License: BSD

Examples

Design principles

Type Row

type DataSource

Type Index

Project status

Type `Row`

type `DataSource`

Type `Index`