|
2 | 2 | <img src="https://github.com/xetdata/pyxet/blob/0c7608c97f6a2a0cb2c83dd38fb717913c4d7522/docs/images/logo.png" alt="logo" width="400" />
|
3 | 3 | </p>
|
4 | 4 |
|
5 |
| -# Welcome to pyxet's documentation! |
| 5 | + |
| 6 | +# pyxet - The SDK for XetHub |
| 7 | + |
| 8 | +[](https://pypi.python.org/pypi/pyxet/) |
| 9 | +[](https://pypi.python.org/pypi/pyxet/) |
| 10 | +[](https://github.com/xetdata/pyxet/blob/main/LICENSE) |
| 11 | +[](https://pypi.python.org/pypi/pyxet/) |
| 12 | +[](https://discord.gg/KCzmjDaDdC) |
6 | 13 |
|
7 | 14 | pyxet is a Python library that provides a lightweight interface for the [XetHub](https://xethub.com/) platform.
|
8 | 15 | XetHub is a blob-store with a filesystem like interface and git capabilities, therefore pyxet implement both.
|
9 | 16 |
|
10 | 17 | ## Features
|
11 | 18 |
|
12 |
| -1. A filesystem interface. |
| 19 | +1. A filesystem interface: |
13 | 20 | * [fsspec](https://filesystem-spec.readthedocs.io)
|
14 | 21 | * copy
|
15 | 22 | * remove
|
16 | 23 | * list
|
17 | 24 | * etc.
|
18 | 25 | * [glob](https://docs.python.org/3/library/glob.html)
|
19 | 26 | * [pathlib.Path](https://docs.python.org/3/library/pathlib.html)(WIP)
|
20 |
| -2. Mount. |
| 27 | + |
| 28 | +2. Mount: |
21 | 29 | * Read-only optimize for speed; perfect for data exploration and analysis and building data-apps and model
|
22 | 30 | inference.
|
23 |
| - * Read-write for data ingestion and preparation; optimal for database backups and training and monitoring logs. |
24 |
| -3. Git capabilities: |
25 |
| - * add, commit, push |
26 |
| - * clone, fork |
27 |
| - * merge, rebase |
28 |
| - * pull, fetch |
29 |
| - * checkout, reset |
30 |
| - * stash, diff, log |
31 |
| - * status, branch |
32 |
| - * submodules |
33 |
| - ... |
34 |
| -4. Integrations: |
| 31 | + * Read-write for data ingestion and preparation; optimal for database backups and training and monitoring logs. _(coming soon)_ |
| 32 | + |
| 33 | +3. Integrations: |
35 | 34 | - [x] [GitHub](https://github.com) [submodule](https://git-scm.com/book/en/v2/Git-Tools-Submodules)
|
36 | 35 | - [x] [pandas](https://pandas.pydata.org)
|
37 | 36 | - [x] [polars](https://pola-rs.github.io/polars-book/)
|
38 | 37 | - [x] [pyarrow](https://arrow.apache.org/docs/python/)
|
39 | 38 | - [ ] [duckdb](https://duckdb.org/)
|
40 | 39 | - [ ] [dask](https://dask.org/)
|
41 | 40 | - [ ] [ray](https://ray.io/)
|
42 |
| -5. CLI: All the features are available through the CLI too under `xet <command>`. |
43 | 41 |
|
44 |
| -For API documentation and full examples, please see the [documentation](TODO). |
| 42 | +For API documentation and full examples, please see the [documentation](https://pyxet.readthedocs.io/en/latest/). |
| 43 | + |
| 44 | +## Getting Started |
| 45 | + |
| 46 | + |
45 | 47 |
|
46 | 48 | ## Installation
|
47 | 49 |
|
@@ -219,4 +221,4 @@ lazy_df = pl.scan_parquet(ds.dataset("file.parquet",
|
219 | 221 |
|
220 | 222 | # Project examples
|
221 | 223 |
|
222 |
| -* [Titanic-app](https://xethub.com/xdssio/titanic-server-example) |
| 224 | +* [Titanic-app](https://xethub.com/xdssio/titanic-server-example) |
0 commit comments