shamoji alternatives and similar packages
Based on the "Natural Language Processing" category.
Alternatively, view shamoji alternatives based on common mentions on social networks and blogs.
-
prose
:book: A Golang library for text processing, including tokenization, part-of-speech tagging, and named-entity extraction. -
gse
Go efficient multilingual NLP and text segmentation; support English, Chinese, Japanese and others. -
spaGO
Self-contained Machine Learning and Natural Language Processing library in Go -
nlp
[UNMANTEINED] Extract values from strings and fill your structs with nlp. -
universal-translator
:speech_balloon: i18n Translator for Go/Golang using CLDR data + pluralization rules -
locales
:earth_americas: a set of locales generated from the CLDR Project which can be used independently or within an i18n package; these were built for use with, but not exclusive to https://github.com/go-playground/universal-translator -
RAKE.go
A Go port of the Rapid Automatic Keyword Extraction algorithm (RAKE) -
segment
A Go library for performing Unicode Text Segmentation as described in Unicode Standard Annex #29 -
go-nlp
Utilities for working with discrete probability distributions and other tools useful for doing NLP work. -
textcat
A Go package for n-gram based text categorization, with support for utf-8 and raw text -
go-localize
i18n (Internationalization and localization) engine written in Go, used for translating locale strings. -
stemmer
Stemmer packages for Go programming language. Includes English, German and Dutch stemmers. -
petrovich
Golang port of Petrovich - an inflector for Russian anthroponyms. -
go-tinydate
A tiny date object in Go. Tinydate uses only 4 bytes of memory -
golibstemmer
Go bindings for the snowball libstemmer library including porter 2 -
gotokenizer
A tokenizer based on the dictionary and Bigram language models for Go. (Now only support chinese segmentation) -
spreak
Flexible translation and humanization library for Go, based on the concepts behind gettext. -
gosentiwordnet
💬 Sentiment analyzer library using SentiWordnet in Go
TestGPT | Generating meaningful tests for busy devs
* Code Quality Rankings and insights are calculated and provided by Lumnify.
They vary from L1 to L5 with "L5" being the highest.
Do you think we are missing an alternative of shamoji or a related project?
README
shamoji
About
The shamoji (杓文字) is word filtering package.
Install
$ go get -u github.com/osamingo/shamoji
Usage
package main
import (
"fmt"
"sync"
"github.com/osamingo/shamoji"
"github.com/osamingo/shamoji/filter"
"github.com/osamingo/shamoji/tokenizer"
"golang.org/x/text/unicode/norm"
)
var (
o sync.Once
s *shamoji.Serve
)
func main() {
yes, word := Contains("我が生涯に一片の悔い無し")
fmt.Printf("Result: %v, Word: %s", yes, word)
}
func Contains(sentence string) (bool, string) {
o.Do(func() {
s = &shamoji.Serve{
Tokenizer: tokenizer.NewKagomeSimpleTokenizer(norm.NFKC),
Filer: filter.NewCuckooFilter("涯に", "悔い"),
}
})
return s.Do(sentence)
}
License
Released under the MIT License.
*Note that all licence references and agreements mentioned in the shamoji README section above
are relevant to that project's source code only.