Issue with diacritics in S3 “File key prefix” (UTF-8 characters encoded)

Hi,
we’re using the Amazon S3 delivery in Docupilot, and we noticed that when the File key prefix contains characters with diacritics (for example: Kvalitativní list), the resulting object key in S3 gets percent-encoded (e.g. Kvalitativn%C3%AD%20list).

However, when we upload files directly to the same S3 bucket using our own scripts, UTF-8 characters appear correctly in S3 (no encoding).

Could you please confirm:

  1. Does Docupilot automatically URL-encode all S3 object keys?

  2. Is there a way to preserve UTF-8 characters (č, ř, í, etc.) in S3 key names?

  3. Or is it recommended to remove diacritics before passing the File key prefix?

Thanks a lot for clarifying!

David

1 Like

Hi @david this issue is acknowledged by our engineering team and are investigating further to understand the best way to address this.

Hi @david,

The issue has been fixed — Docupilot now preserves UTF-8 characters (like č, ř, í, etc.) in S3 object keys without percent-encoding.

Thanks for reporting this and helping us improve!

1 Like

Hi @rohit

Thanks for quickly fix.

David

3 Likes