Popularity

6.8

Stable

Activity

0.0

Stable

Stars 624

Watchers 16

Forks 63

Last Commit about 1 year ago

Programming language: Go

License: MIT License

Tags: Natural Language Processing

Latest version: v1.0.1

whatlanggo alternatives and similar packages

Based on the "Natural Language Processing" category.
Alternatively, view whatlanggo alternatives based on common mentions on social networks and blogs.

prose

8.7 1.9 whatlanggo VS prose

DISCONTINUED. :book: A Golang library for text processing, including tokenization, part-of-speech tagging, and named-entity extraction.
go-i18n

8.5 7.1 whatlanggo VS go-i18n

Translate your Go program into multiple languages.

WorkOS - The modern identity platform for B2B SaaS

The APIs are flexible and easy-to-use, supporting authentication, user identity, and complex enterprise features like SSO and SCIM provisioning.

Promo workos.com

gojieba

8.4 1.9 whatlanggo VS gojieba

"结巴"中文分词的Golang版本
gse

8.4 4.4 whatlanggo VS gse

Go efficient multilingual NLP and text segmentation; support English, Chinese, Japanese and others.
go-pinyin

7.9 4.5 whatlanggo VS go-pinyin

汉字转拼音
spaGO

7.9 0.0 whatlanggo VS spaGO

DISCONTINUED. Self-contained Machine Learning and Natural Language Processing library in Go
when

7.6 5.1 whatlanggo VS when

A natural language date/time parser with pluggable rules
kagome

7.0 6.4 whatlanggo VS kagome

Self-contained Japanese Morphological Analyzer written in pure Go
nlp

6.3 0.0 whatlanggo VS nlp

DISCONTINUED. [UNMANTEINED] Extract values from strings and fill your structs with nlp.
sentences

6.2 4.5 whatlanggo VS sentences

A multilingual command line sentence tokenizer in Golang
universal-translator

6.1 0.0 whatlanggo VS universal-translator

:speech_balloon: i18n Translator for Go/Golang using CLDR data + pluralization rules
locales

5.9 0.0 whatlanggo VS locales

:earth_americas: a set of locales generated from the CLDR Project which can be used independently or within an i18n package; these were built for use with, but not exclusive to https://github.com/go-playground/universal-translator
getlang

4.9 0.0 whatlanggo VS getlang

Natural language detection package in pure Go
go-unidecode

4.5 3.1 whatlanggo VS go-unidecode

ASCII transliterations of Unicode text.
RAKE.go

4.5 0.0 whatlanggo VS RAKE.go

A Go port of the Rapid Automatic Keyword Extraction algorithm (RAKE)
go-nlp

4.3 0.0 whatlanggo VS go-nlp

DISCONTINUED. Utilities for working with discrete probability distributions and other tools useful for doing NLP work.
segment

4.3 0.0 whatlanggo VS segment

A Go library for performing Unicode Text Segmentation as described in Unicode Standard Annex #29
gounidecode

4.2 0.0 whatlanggo VS gounidecode

Unicode transliterator for #golang
go-stem

3.9 0.0 whatlanggo VS go-stem

Word Stemming in Go
textcat

3.8 0.0 whatlanggo VS textcat

A Go package for n-gram based text categorization, with support for utf-8 and raw text
MMSEGO

3.6 0.0 whatlanggo VS MMSEGO

Chinese word splitting algorithm MMSEG in GO
go-localize

3.3 0.0 whatlanggo VS go-localize

i18n (Internationalization and localization) engine written in Go, used for translating locale strings.
address

3.3 6.5 whatlanggo VS address

Address handling for Go.
go2vec

3.2 0.0 whatlanggo VS go2vec

Read and use word2vec vectors in Go
stemmer

3.1 0.0 whatlanggo VS stemmer

Stemmer packages for Go programming language. Includes English, German and Dutch stemmers.
petrovich

2.9 3.8 whatlanggo VS petrovich

Golang port of Petrovich - an inflector for Russian anthroponyms.
porter2

2.9 0.0 whatlanggo VS porter2

High Performance Porter2 Stemmer
iuliia-go

2.8 1.8 whatlanggo VS iuliia-go

Transliterate Cyrillic → Latin in every possible way
dpar

2.8 3.2 whatlanggo VS dpar

Neural network transition-based dependency parser (in Rust)
govader

2.7 0.0 whatlanggo VS govader

vader sentiment analysis in go
go-mystem

2.6 0.0 whatlanggo VS go-mystem

CGo bindings to Yandex.Mystem
go-tinydate

2.5 0.0 whatlanggo VS go-tinydate

A tiny date object in Go. Tinydate uses only 4 bytes of memory
spreak

2.4 6.4 whatlanggo VS spreak

Flexible translation and humanization library for Go, based on the concepts behind gettext.
paicehusk

2.4 0.0 whatlanggo VS paicehusk

Golang implementation of the Paice/Husk Stemming Algorithm
snowball

2.4 0.0 L1 whatlanggo VS snowball

Cgo binding for Snowball C library
gotokenizer

2.0 0.0 whatlanggo VS gotokenizer

A tokenizer based on the dictionary and Bigram language models for Go. (Now only support chinese segmentation)
golibstemmer

2.0 0.0 whatlanggo VS golibstemmer

Go bindings for the snowball libstemmer library including porter 2
detectlanguage

2.0 0.0 whatlanggo VS detectlanguage

Detect Language API Go Client
icu

1.8 0.0 whatlanggo VS icu

Cgo binding for icu4c library
libtextcat

1.8 0.0 whatlanggo VS libtextcat

Cgo binding for libtextcat C library
t

1.8 3.5 whatlanggo VS t

t: translation util for go, using GNU gettext
shamoji

1.3 0.0 whatlanggo VS shamoji

The shamoji (杓文字) is a word filtering package
porter

1.2 0.0 whatlanggo VS porter

porter stemmer
gosentiwordnet

0.9 0.0 whatlanggo VS gosentiwordnet

💬 Sentiment analyzer library using SentiWordnet in Go
go-eco

0.5 0.0 whatlanggo VS go-eco

Automatically exported from code.google.com/p/go-eco
govader-backend

0.5 2.6 whatlanggo VS govader-backend

Sentimental Analysis Microservice
spelling-corrector

0.3 0.0 whatlanggo VS spelling-corrector

Spelling corrector for Spanish language

* Code Quality Rankings and insights are calculated and provided by Lumnify.
They vary from L1 to L5 with "L5" being the highest.

Do you think we are missing an alternative of whatlanggo or a related project?

Add another 'Natural Language Processing' Package

Popular Comparisons

README

Whatlanggo

Natural language detection for Go.

Features

Supports 84 languages
100% written in Go
No external dependencies
Fast
Recognizes not only a language, but also a script (Latin, Cyrillic, etc)

Getting started

Installation:

    go get -u github.com/abadojack/whatlanggo

Simple usage example:

package main

import (
    "fmt"

    "github.com/abadojack/whatlanggo"
)

func main() {
    info := whatlanggo.Detect("Foje funkcias kaj foje ne funkcias")
    fmt.Println("Language:", info.Lang.String(), " Script:", whatlanggo.Scripts[info.Script], " Confidence: ", info.Confidence)
}

Blacklisting and whitelisting

package main

import (
    "fmt"

    "github.com/abadojack/whatlanggo"
)

func main() {
    //Blacklist
    options := whatlanggo.Options{
        Blacklist: map[whatlanggo.Lang]bool{
            whatlanggo.Ydd: true,
        },
    }

    info := whatlanggo.DetectWithOptions("האקדמיה ללשון העברית", options)

    fmt.Println("Language:", info.Lang.String(), "Script:", whatlanggo.Scripts[info.Script])

    //Whitelist
    options1 := whatlanggo.Options{
        Whitelist: map[whatlanggo.Lang]bool{
            whatlanggo.Epo: true,
            whatlanggo.Ukr: true,
        },
    }

    info = whatlanggo.DetectWithOptions("Mi ne scias", options1)
    fmt.Println("Language:", info.Lang.String(), " Script:", whatlanggo.Scripts[info.Script])
}

For more details, please check the documentation.

Requirements

Go 1.8 or higher

How does it work?

How does the language recognition work?

The algorithm is based on the trigram language models, which is a particular case of n-grams. To understand the idea, please check the original whitepaper Cavnar and Trenkle '94: N-Gram-Based Text Categorization'.

How IsReliable calculated?

It is based on the following factors:

How many unique trigrams are in the given text
How big is the difference between the first and the second(not returned) detected languages? This metric is called rate in the code base.

Therefore, it can be presented as 2d space with threshold functions, that splits it into "Reliable" and "Not reliable" areas. This function is a hyperbola and it looks like the following one:

For more details, please check a blog article Introduction to Rust Whatlang Library and Natural Language Identification Algorithms.

License

MIT

Derivation

whatlanggo is a derivative of Franc (JavaScript, MIT) by Titus Wormer.

Acknowledgements

Thanks to greyblake (Potapov Sergey) for creating whatlang-rs from where I got the idea and algorithms.

*Note that all licence references and agreements mentioned in the whatlanggo README section above are relevant to that project's source code only.

whatlanggo

Natural language detection library for Go