Package: ngram 3.2.3

ngram: Fast n-Gram 'Tokenization'

An n-gram is a sequence of n "words" taken, in order, from a body of text. This is a collection of utilities for creating, displaying, summarizing, and "babbling" n-grams. The 'tokenization' and "babbling" are handled by very efficient C code, which can even be built as its own standalone library. The babbler is a simple Markov chain. The package also offers a vignette with complete example 'workflows' and information about the utilities offered in the package.

Authors:Drew Schmidt [aut, cre], Christian Heckendorf [aut]

ngram_3.2.3.tar.gz
ngram_3.2.3.zip(r-4.5)ngram_3.2.3.zip(r-4.4)ngram_3.2.3.zip(r-4.3)
ngram_3.2.3.tgz(r-4.5-x86_64)ngram_3.2.3.tgz(r-4.5-arm64)ngram_3.2.3.tgz(r-4.4-x86_64)ngram_3.2.3.tgz(r-4.4-arm64)ngram_3.2.3.tgz(r-4.3-x86_64)ngram_3.2.3.tgz(r-4.3-arm64)
ngram_3.2.3.tar.gz(r-4.5-noble)ngram_3.2.3.tar.gz(r-4.4-noble)
ngram_3.2.3.tgz(r-4.4-emscripten)ngram_3.2.3.tgz(r-4.3-emscripten)
ngram.pdf |ngram.html✨
ngram/json (API)

# Install 'ngram' in R:

install.packages('ngram', repos = c('https://wrathematics.r-universe.dev', 'https://cloud.r-project.org'))

Bug tracker:https://github.com/wrathematics/ngram/issues

On CRAN:

ngram text text-mining

10.45 score 71 stars 7 packages 844 scripts 3.0k downloads 5 mentions 18 exports 0 dependencies

Last updated 1 years agofrom:99ebbc3790. Checks:12 OK. Indexed: yes.

Target	Result	Latest binary
Doc / Vignettes	OK	Mar 04 2025
R-4.5-win-x86_64	OK	Mar 04 2025
R-4.5-mac-x86_64	OK	Mar 04 2025
R-4.5-mac-aarch64	OK	Mar 04 2025
R-4.5-linux-x86_64	OK	Mar 04 2025
R-4.4-win-x86_64	OK	Mar 04 2025
R-4.4-mac-x86_64	OK	Mar 04 2025
R-4.4-mac-aarch64	OK	Mar 04 2025
R-4.4-linux-x86_64	OK	Mar 04 2025
R-4.3-win-x86_64	OK	Mar 04 2025
R-4.3-mac-x86_64	OK	Mar 04 2025
R-4.3-mac-aarch64	OK	Mar 04 2025

Exports:babble concatenate get.nextwords get.ngrams get.phrasetable get.string getseed multiread ng_order ngram ngram_asweka preprocess print rcorpus show splitter string.summary wordcount

Dependencies:

Guide to the ngram Package

Rendered fromngram-guide.Rnwusingutils::Sweaveon Mar 04 2025.

Last update: 2022-03-13
Started: 2014-06-16

Citation

Development and contributors

Readme and manuals

Help Manual

Help page	Topics
ngram: Fast n-Gram Tokenization	ngram-package
ngram Babbler	babble babble,ngram-method
Concatenate	concatenate
getseed	getseed
ngram Getters	get.nextwords get.nextwords,ngram-method get.ngrams get.ngrams,ngram-method get.string get.string,ngram-method getters ng_order ng_order,ngram-method
Multiread	multiread
n-gram Tokenization	ngram tokenize
Class ngram	ngram-class
ngram printing	ngram-print print,ngram-method show,ngram-method
Get Phrasetable	get.phrasetable phrasetable
Basic Text Preprocessor	preprocess
Random Corpus	rcorpus
Character Splitter	splitter
Text Summary	string.summary
Weka-like n-gram Tokenization	ngram_asweka Tokenize-AsWeka
wordcount	wordcount wordcount.character wordcount.ngram