You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
It gives output: ['date_partition_col=2023-11-05%2000%253A00%253A00/myfile.parquet']
Notice that the output is url encoded form of my actual file.
Thus, when reading these files I am getting 404 error:
HTTP status client error (404 Not Found) for url (https://s3.ap-south-1.amazonaws.com/my_bucket/my_delta_lake/date_partition_col=2023-11-05%2000%253A00%253A00/myfile.parquet)
What you expected to happen:
Not re-encode already encoded files.
How to reproduce it:
List/Read files from delta lake that have URL encoded partitions
More details:
I see the fix for this issue for filesystem was already included in v0.10.2 release, I am facing exact same issue even though I am using v0.13.0 (Python)
Thus had to create a new issue.
The text was updated successfully, but these errors were encountered:
Environment
Delta-rs version: 0.13.0
Binding: Python
Environment: Local and K8s Pod both
Bug
What happened:
date_partition_col=2023-11-05 00%3A00%3A00/myfile.parquet
Notice that Spark automatically saves it in a URL encoded format.
It gives output:
['date_partition_col=2023-11-05%2000%253A00%253A00/myfile.parquet']
Notice that the output is url encoded form of my actual file.
Thus, when reading these files I am getting 404 error:
HTTP status client error (404 Not Found) for url (https://s3.ap-south-1.amazonaws.com/my_bucket/my_delta_lake/date_partition_col=2023-11-05%2000%253A00%253A00/myfile.parquet)
What you expected to happen:
How to reproduce it:
More details:
I see the fix for this issue for filesystem was already included in v0.10.2 release, I am facing exact same issue even though I am using v0.13.0 (Python)
Thus had to create a new issue.
The text was updated successfully, but these errors were encountered: