Skip to content

Commit

Permalink
docs: update chat_knowledge.md and wechat.jpg (#1035)
Browse files Browse the repository at this point in the history
  • Loading branch information
Aries-ckt authored Jan 5, 2024
1 parent 186b6a5 commit 5bc4196
Show file tree
Hide file tree
Showing 3 changed files with 18 additions and 1 deletion.
Binary file modified assets/wechat.jpg
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
17 changes: 17 additions & 0 deletions docs/docs/application/started_tutorial/chat_knowledge.md
Original file line number Diff line number Diff line change
Expand Up @@ -56,6 +56,23 @@ and click Process, it will take a few minutes to complete the document segmentat
<img src={'/img/chat_knowledge/doc_segmentation.png'} width="720px" />
</p>

:::tip
**Automatic: The document is automatically segmented according to the document type.**

**Chunk size: The number of words in each segment of the document. The default is 512 words.**
- chunk size: The number of words in each segment of the document. The default is 512 words.
- chunk overlap: The number of words overlapped between each segment of the document. The default is 50 words.
** Separator:segmentation by separator **
- separator: The separator of the document. The default is `\n`.
- enable_merge: Whether to merge the separator chunks according to chunk_size after splits. The default is `False`.
** Page: page segmentation, only support .pdf and .pptx document.**

** Paragraph: paragraph segmentation, only support .docx document.**
- separator: The paragraph separator of the document. The default is `\n`.

** Markdown header: markdown header segmentation, only support .md document.**
:::


### Waiting for document vectorization

Expand Down
2 changes: 1 addition & 1 deletion setup.py
Original file line number Diff line number Diff line change
Expand Up @@ -638,7 +638,7 @@ def init_install_requires():
setuptools.setup(
name="db-gpt",
packages=find_packages(exclude=("tests", "*.tests", "*.tests.*", "examples")),
version="0.4.4",
version="0.4.5",
author="csunny",
author_email="[email protected]",
description="DB-GPT is an experimental open-source project that uses localized GPT large models to interact with your data and environment."
Expand Down

0 comments on commit 5bc4196

Please sign in to comment.