Skip to content

Commit 2ae5e45

Browse files
committed
Update text_splitter to 4000 chunk_size and 200 overlap
1 parent c78fe49 commit 2ae5e45

1 file changed

Lines changed: 1 addition & 1 deletion

File tree

rabbithole/loader.py

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -33,7 +33,7 @@ def load_file(file: UploadedFile) -> list[Document]:
3333
Supported file types: PDF
3434
:return: List of Document objects
3535
"""
36-
text_splitter = CharacterTextSplitter(chunk_size=1000, chunk_overlap=0)
36+
text_splitter = CharacterTextSplitter(chunk_size=4000, chunk_overlap=200)
3737

3838
# Handle .docx files
3939
if file.name.endswith(".docx"):

0 commit comments

Comments
 (0)