-
Notifications
You must be signed in to change notification settings - Fork 1
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Introduce Deltalake #11
Conversation
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Thanks @mrocklin I'll try this out and try to get it running on the cloud as well
generate_data.py
Outdated
else: | ||
path = pathlib.Path(path) | ||
path.mkdir(parents=True, exist_ok=True) |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Hmm I'm curious why this is needed given the mkdir
calls later in this function, but I'll probably find out while running this code locally. Feel free to fill me in if you recall why this was added
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
This might just be a merge issue.
@mrocklin I'm stepping away for a bit but will circle back here. Just pushed commits that resolve merge conflicts and make this run on S3 (and local still) |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Thanks @mrocklin
This doesn't work yet. I'm getting odd column errors in the tpch query. Sharing early in case people have thoughts.