Replies: 4 comments 8 replies
-
you can try changing the separator to split on something else instead of line breaks or using a better splitter than this which can be like this one |
Beta Was this translation helpful? Give feedback.
-
Hi @Hamas-ur-Rehman, |
Beta Was this translation helpful? Give feedback.
-
You could use TokenTextSplitter instead of CharacterTextSplitter, which could accurately split text based on the chunk_size, with each chunk having the same token |
Beta Was this translation helpful? Give feedback.
-
If Is there any way to stop this message from printing Created a chunk of size n, which is longer than the specified x, something like verbose = False somewhere? |
Beta Was this translation helpful? Give feedback.
-
I am not sure this is the right forum to ask a question.
I have a very large document. I get the following message when I split the text.
Created a chunk of size 1870, which is longer than the specified 1000
text_splitter = CharacterTextSplitter(
separator="\n",
chunk_size=1000,
chunk_overlap=200,
length_function=len
)
texts = text_splitter.split_text(raw_text)
Beta Was this translation helpful? Give feedback.
All reactions