TX-trie

Tx: Succinct Trie Data structure

Tx is a library for a compact trie data structure. Tx requires 1/4 - 1/10 of the memory usage compared to the previous implementations, and can therefore handle quite a large number of keys (e.g. 1 billion) efficiently. A trie data structure supports exact matching and common prefix matching, which are used for natural language processing etc. Tx uses Level-Order Unary Degree Sequence (LOUDS) for trie representation.

How to build

$ ./autogen.sh $ ./configure $ make

USAGE

build index

[wordlist_file]: Word list file name. One word per line.

Ex)

apple
orange
banana

[index_file]: output index file name.

$ txbuild [wordlist_file] [index_file]
word list 3 elements
outputSize:56 inputSize:17 ratio:3.29412

listup words

$ txlist [index_file]
apple
banana
orange

search

$ txsearch [index_file]
keyNum:3 nodeNum:18
>apple
query:apple
5
prefixSearch id:0 len:5 lookup:apple
expansionSearch 1
apple
commonPrefixSearch 1
apple (id=0)
predictiveSearch 1
apple (id=0)
>pearch
query:pearch
6
prefixSearch not found
expansionSearch 0
commonPrefixSearch 0
predictiveSearch 0
>

Name		Name	Last commit message	Last commit date
Latest commit History 28 Commits
.gitignore		.gitignore
LICENSE		LICENSE
Makefile.am		Makefile.am
README.md		README.md
autogen.sh		autogen.sh
configure.ac		configure.ac
s2sbuild.cpp		s2sbuild.cpp
s2ssearch.cpp		s2ssearch.cpp
ssv.cpp		ssv.cpp
ssv.hpp		ssv.hpp
tx.cpp		tx.cpp
tx.hpp		tx.hpp
tx.pc.in		tx.pc.in
txbuild.cpp		txbuild.cpp
txlist.cpp		txlist.cpp
txsearch.cpp		txsearch.cpp
txsearch_mmap.cpp		txsearch_mmap.cpp

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

TX-trie

How to build

USAGE

build index

listup words

search

About

Releases

Packages

Contributors 5

Languages

License

retrieva/tx-trie

Folders and files

Latest commit

History

Repository files navigation

TX-trie

How to build

USAGE

build index

listup words

search

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Contributors 5

Languages

Packages