Skip to content

Commit 8d9badd

Browse files
Downloaded from http://dhost.info/paq8/
0 parents  commit 8d9badd

File tree

223 files changed

+1361652
-0
lines changed

Some content is hidden

Large Commits have some content hidden by default. Use the searchbox below for content that may be hidden.

223 files changed

+1361652
-0
lines changed

fp8_v1/fp8_v1.cpp

+4,289
Large diffs are not rendered by default.

fp8_v2/fp8_v2.cpp

+4,303
Large diffs are not rendered by default.

fp8_v3/fp8_v3.cpp

+4,312
Large diffs are not rendered by default.

paq8a/paq8a.cpp

+2,886
Large diffs are not rendered by default.

paq8b/TextFilter/WRT-ENG.DIC

+107,694
Large diffs are not rendered by default.

paq8b/TextFilter/readme.txt

+44
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,44 @@
1+
PAQ8B = PAQ8A + TextFilter 1.2 for PAQ by P.Skibinski (based on WRT 4.5)
2+
3+
1. If you're compressing multi-lingual files you can put additional dictionaries
4+
into the "TextFilter" directory (they are auto-detected). For now there are
5+
4 additional dictionaries available: German, French, Polish and Russian:
6+
WRT-DE.DIC, 463918 bytes (available at http://www.ii.uni.wroc.pl/~inikep/research/wrt40-de.rar)
7+
WRT-FR.DIC, 648314 bytes (available at http://www.ii.uni.wroc.pl/~inikep/research/wrt40-fr.rar)
8+
WRT-PL.DIC, 1479847 bytes (available at http://www.ii.uni.wroc.pl/~inikep/research/wrt40-pl.rar)
9+
WRT-RU.DIC, 4954251 bytes (available at http://www.ii.uni.wroc.pl/~inikep/research/wrt40-ru.rar)
10+
11+
2. You can easily add your own dictionaries. Structure of a dictionary is following:
12+
1st line - a number of words in a dictionary (used only to select codewords)
13+
2nd line - original charset, capital letters (for national letters in multi-lingual files)
14+
3rd line - original charset, lower-case letters (for national letters in multi-lingual files)
15+
4th line - second charset, capital letters (different encoding)
16+
5th line - second charset, lower-case letters (different encoding)
17+
6th line - third charset, capital letters (different encoding)
18+
7th line - third charset, lower-case letters (different encoding)
19+
8th line - fourth charset, capital letters (different encoding)
20+
9th line - fourth charset, lower-case letters (different encoding)
21+
10th line - fifth charset, capital letters (different encoding)
22+
11th line - fifth charset, lower-case letters (different encoding)
23+
12th line - sixth charset, capital letters (different encoding)
24+
13th line - sixth charset, lower-case letters (different encoding)
25+
14th line - first word in a dictionary
26+
15th line - second word in a dictionary
27+
16th line - and so on
28+
29+
Example (WRT-PL.DIC):
30+
191796
31+
��ʣ�ӌ�� - original charset (Windows Latin2/cp1250)
32+
����󜿟
33+
��ʣ�Ӧ�� - second charset (ISO-8859-2)
34+
����󶿼
35+
��������� - third charset (DOS Latin2/cp852)
36+
����䢘��
37+
��������� - fourth charset (DOS Mazovia)
38+
���������
39+
��������� - fifth charset (AmigaPL)
40+
���������
41+
ACELNOSZZ - sixth charset (no polish letters)
42+
acelnoszz
43+
na - first word in a dictionary
44+
do - second word in a dictionary

paq8b/TextFilter/wrt-short-c++.dic

+100
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,100 @@
1+
Short dictionary "c++" by Przemyslaw Skibinski for WRT 4.0 (best results when max 85 words). Don't change the following 12 lines:
2+
#
3+
#
4+
#
5+
#
6+
#
7+
#
8+
#
9+
#
10+
#
11+
#
12+
#
13+
#
14+
the
15+
and
16+
for
17+
function
18+
malloc
19+
calloc
20+
free
21+
getc
22+
delete
23+
data
24+
argc
25+
static
26+
unsigned
27+
defined
28+
void
29+
end
30+
dword
31+
code
32+
mode
33+
#include
34+
message
35+
table
36+
file
37+
while
38+
#define
39+
type
40+
procedure
41+
case
42+
false
43+
else
44+
#else
45+
byte
46+
true
47+
size
48+
typedef
49+
#ifdef
50+
#ifndef
51+
#if
52+
endif
53+
#endif
54+
sizeof
55+
printf
56+
fprintf
57+
long
58+
debug
59+
length
60+
break
61+
unlink
62+
real
63+
null
64+
bool
65+
stream
66+
program
67+
then
68+
main
69+
stdin
70+
min
71+
union
72+
version
73+
return
74+
strcmp
75+
getchar
76+
purchar
77+
char
78+
register
79+
error
80+
stderr
81+
ptr
82+
class
83+
struct
84+
get
85+
set
86+
memset
87+
int
88+
uint
89+
start
90+
short
91+
list
92+
const
93+
stdout
94+
input
95+
output
96+
next
97+
argv
98+
new
99+
max
100+

0 commit comments

Comments
 (0)