Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Fixes: #4592; Flesh Out Apache Spark Examples Documentation #4631

Open
wants to merge 2 commits into
base: main
Choose a base branch
from

Conversation

himanshumahajan138
Copy link
Contributor

Pull Request

Flesh Out Apache Spark Examples Documentation
Fixes: #4592

Description

Scala Spark Examples 4-rdd-spark, 5-sql-analytics, 6-spark-streaming, 7-spark-mllib, 8-spark-graphx

Related Issues

Checklist

  • 4-rdd-spark
  • 5-sql-analytics
  • 6-spark-streaming
  • 7-spark-mllib
  • 8-spark-graphx
  • Updated Documentation
  • Update Previous Examples

Status

Examples Added Documentation Update Required & Require Example Reviews

@himanshumahajan138
Copy link
Contributor Author

@lihaoyi Good Wishes Sir!

Could u please review the examples once and if satisfied then i will move to documentation

Few Things i want to Mention:

  1. We have one example of spark-submit so for safe apart from duplication i haven't used spark submit in these examples, Will use if you want or say
  2. I have Added almost all learning aspects from beginner to advance topics as per official spark docs
  3. Documentation will be updated upon satisfaction

If Want any changes or Requirement please once let me know

Thanks !!!

@lihaoyi
Copy link
Member

lihaoyi commented Mar 1, 2025

@himanshumahajan138 this is not what I want. This PR contains different code with the same build config over and over with zero explanation. The ticket is to show off use cases which require different build configs, with the minimal code necessary to exercise the build, with clear explanations of why those build configs are the way they are

@himanshumahajan138
Copy link
Contributor Author

@lihaoyi i understood that we need to have explanation and different build configs but what about the core concepts like the examples tittle are they the ones expected (RDD, Streaming, MLlib...)

Please confirm this once

@lihaoyi
Copy link
Member

lihaoyi commented Mar 1, 2025

The important thing here is the interactions between Spark and the build tool Mill. We can assume that anyone who is reading this knows how Spark works.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

Flesh out Apache Spark Examples documentation (500USD Bounty)
2 participants