Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Losing chunks of replies #261

Closed
jwr opened this issue Mar 19, 2024 · 15 comments
Closed

Losing chunks of replies #261

jwr opened this issue Mar 19, 2024 · 15 comments

Comments

@jwr
Copy link

jwr commented Mar 19, 2024

I only noticed this with Anthropic, but I am not 100% certain this applies to Anthropic only.

I get replies where often several characters are missing. I investigated, turned on logging and here is an example. In my output gptel session I see "dostępnośćwencji", while the log shows:

event: content_block_delta
data: {"type":"content_block_delta","index":0,"delta":{"type":"text_delta","text":" dost"}    }

event: content_block_delta
data: {"type":"content_block_delta","index":0,"delta":{"type":"text_delta","text":"ęp"}            }

event: content_block_delta
data: {"type":"content_block_delta","index":0,"delta":{"type":"text_delta","text":"ność"}           }

event: content_block_delta
data: {"type":"content_block_delta","index":0,"delta":{"type":"text_delta","text":" sek"}  }

event: content_block_delta
data: {"type":"content_block_delta","index":0,"delta":{"type":"text_delta","text":"w"}            }

event: content_block_delta
data: {"type":"content_block_delta","index":0,"delta":{"type":"text_delta","text":"enc"}               }

event: content_block_delta
data: {"type":"content_block_delta","index":0,"delta":{"type":"text_delta","text":"ji"}    }

Note the missing " sek" chunk. It's there in the log, so this would indicate that we received it, but lost it afterwards.

This happens irregularly, sometimes 1-2 times per paragraph, sometimes once every several paragraphs.

This happens at least with output to a new buffer, or to a gptel session.

@karthink
Copy link
Owner

Thanks, I guess the fix in #233 isn't quite complete. I It's a timing based bug, so I'll need to set up a simulation of the streaming and try to find the right conditions to reproduce it.

karthink added a commit that referenced this issue Mar 20, 2024
* gptel-anthropic.el (gptel-curl--parse-stream): Remove extraneous
statements and simplify the stream parser.  Addresses #261, but
the main problem is still elusive.
@karthink
Copy link
Owner

@jwr Could you try this:

Advise the stream filter as follows to capture the output (clear the scratch buffer first):

(defun gptel-filter-debug (_ output)
  (with-current-buffer (get-buffer-create "*scratch*")
    (insert output "\n-----\n")))

(advice-add 'gptel-curl--stream-filter :before #'gptel-filter-debug)

Use the Anthropic backend and reproduce the error, then paste the contents of the scratch buffer here.

Remove the advice afterwards so you can use gptel normally.

(advice-remove 'gptel-curl--stream-filter #'gptel-filter-debug)

This should help me nail down the timing to test against.

@jwr
Copy link
Author

jwr commented Mar 25, 2024

This doesn't seem to happen as often as before, but here are some examples:

Output is "By combining anch n", should be "By combining anchored n":

event: content_block_delta
data: {"type":"content_block_delta","index":0,"delta":{"type":"text_delta","text":" By"}               }

event: content_block_delta
data: {"type":"content_block_delta","index":0,"delta":{"type":"text_delta","text":" combining"}         }

event: content_block_delta
data: {"type":"content_block_delta","index":0,"delta":{"type":"text_delta","text":" anch"}           }

event: cont
-----
ent_block_delta
data: {"type":"content_block_delta","index":0,"delta":{"type":"text_delta","text":"ored"}        }

event: content_block_delta
data: {"type":"content_block_delta","index":0,"delta":{"type":"text_delta","text":" n"}     }

Output is "the relev and accuracy", should be "the relevance and accuracy":

event: content_block_delta
data: {"type":"content_block_delta","index":0,"delta":{"type":"text_delta","text":" the"}   }

event: content_block_delta
data: {"type":"content_block_delta","index":0,"delta":{"type":"text_delta","text":" relev"}  }

event: conten
-----
t_block_delta
data: {"type":"content_block_delta","index":0,"delta":{"type":"text_delta","text":"ance"}  }

event: content_block_delta
data: {"type":"content_block_delta","index":0,"delta":{"type":"text_delta","text":" and"}      }

event: content_block_delta
data: {"type":"content_block_delta","index":0,"delta":{"type":"text_delta","text":" accuracy"}             }

I hope this helps.

@jwr
Copy link
Author

jwr commented Mar 25, 2024

I just realized you wanted the entire *scratch* contents:

scratch.txt

@CestDiego
Copy link

I'm having this same issue only with claude models as well.

@jwr
Copy link
Author

jwr commented Mar 29, 2024

Yes, this seems to happen with Haiku, Sonnet and Opus.

karthink added a commit that referenced this issue Mar 29, 2024
gptel-anthropic.el (gptel-curl--parse-stream): When parsing
responses from Anthropic, wait for more input when the
corresponding data chunk for the event hasn't arrived yet. (#261)
@karthink
Copy link
Owner

Pushed a commit (26326c3), please check if this is fixed now.

@jwr
Copy link
Author

jwr commented Mar 30, 2024

I am no longer able to reproduce the problem 🙂🙏 Thank you!

@benthamite
Copy link
Contributor

benthamite commented Apr 19, 2024

Unfortunately, I am experiencing this same issue, despite having updated to the latest commit (#70889ad). Attached are the contents of the *scratch* buffer, after following the instructions in this comment above.

Not sure if it’s useful, but an example of missing text occurs in the sentence starting "Pero cuando encuentres algo que...".

scratch.txt

@karthink
Copy link
Owner

@benthamite thanks for the data. Do you have the text as it is inserted into your buffer? I'd like to know exactly which chunk was missed.

@benthamite
Copy link
Contributor

benthamite commented Apr 20, 2024

@karthink, yes, I saved the gptel buffer to a file, which you can find below. (The file is zipped because GitHub didn't allow me to upload the org-mode file.)

In the example above, the truncated sentence is

Pero cuando encuentres algo que teté amargando, detente.

I think the full full sentence should have been

Pero cuando encuentres algo que te esté amargando, detente.

alegremente-claude-translate.org.zip

@benthamite
Copy link
Contributor

(There are other examples. I can list them if that helps.)

@karthink
Copy link
Owner

Has this happened recently? I think we can close this otherwise.

@jwr
Copy link
Author

jwr commented Dec 22, 2024

Not the originally reported problem, but I have been getting truncated responses from Anthropic Claude recently. I haven't had the time to pursue, debug or report it, but I thought I'd let you know as a data point.

This time the behavior was different, though: responses were clearly truncated at the end. E.g. I would get 2000 characters and then the reply would end abruptly with the last 200 or so missing.

Again, just a data point, please feel free to ignore, as I am unable to pursue this right now…

@karthink
Copy link
Owner

@jwr Maybe you're hitting the token limit?

  • You can check the log to see if this is the case. (setq gptel-log-level 'info), then check the *gptel-log* buffer. In the last data chunk there should be a "stop_reason" field.
  • You can try increasing gptel-max-tokens. I think the Anthropic API caps it by default to 1024 tokens, although that's much more than 2000 characters.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

4 participants