Refactor statusapiv1 to trait and implement for ease of creation of these objects when we implement our own parser #248

shankar37 · 2017-05-15T12:54:13Z

This changes the statusapiv1 classes to be a trait and creates an impl classes for the same. This change in itself seems to achieve nothing. But this is in preparation for rewriting the eventlogparsing code to be implemented using our custom listeners and create the objects from that. The previous classes with val were hard to create as it required to have all the data before creating the objects. the new impl classes have var members which will make it easy to set data.

…or uptime is allocated resources and task time is used resources

…efactor # Conflicts: # app/com/linkedin/drelephant/spark/fetchers/SparkFetcher.scala

…amic allocation

…hese objects when we implement our own parser

…ndle dynamic allocation" This reverts commit 9d550f4.

superbobry · 2017-05-15T13:18:48Z

Hi @shankar37, I think the PR is missing a commit renaming API classes to *Impl.

The PR seems to be missing a bit of context. Do you plan to make the JSON parsing streaming?

shkhrgpt · 2017-05-15T16:02:54Z

@shankar37 From this PR, it is difficult to understand the motivation of the change, and therefore it's difficult to review. Can you please share the design of the eventlogparsing code.

shankar37 · 2017-05-16T00:58:53Z

Here is hopefully the context.

Currently, the event log parsing happens in two ways. For the SparkFetcher, it streams the json event log and looks for EnvironmentUpdate Event only. This code is in SparkLogClient.scala. This does not use the spark's replaybus and uses only the public apis of Spark. In addition, the SparkFetcher uses the SparkRestClient to get the rest of the data from REST APIs and deserializes them directly into the objects of statusapiv1.

The other one is in SparkFSFetcher, which uses replaybus and sparklistners's to parse the event log. Then it uses the LegacyDataConverter to convert the data read into the statusapiv1. There are a couple of problems with this. First, this doesn't parse all the data from the event log and some of the data we might require is not present. Second, in light of SPARK-18085 and this commit (apache/spark@561e9cc), these replaybus and these listeners are deprecated. Hence, we need to change the parsing of event log for FS Fetcher to use our own parser.

What I am planning to do is to write a generic EventLogParser which will take a stream and parse the event log json. It will parse it like SparkLogClient but for all events that we care for and will produce the StatusApiV1 data in its completely. Then both the SparkFSFetcher and SparkLogClient will be made to use. It will take boolean flags to indicate which part of the data needs to be parsed to avoid parsing stuff the client dont need. When we do that the EventLogParser needs to read one event at a time, store some intermediate data and convert it into statusapiv1 structure. I am trying to avoid create a lot of intermediate data and field by field copy to statusapiv1 like for legacydataconverter does. Instead, I want to create the statusapiv1 Impl objects, fill and modify data as I parse and calculate them and then just return it asInstanceOf statusapiv1 trait. This PR is the first step towards that. It makes the SparkAppliationData contain only traits. So, readers( heuristics and metricsAggregator) will continue to get read only data. But writers can create the Impl objects and write to individual fields. The problem with have a class with only Val is that you have to create it only when all the fields are available. And that is difficult to do when you are parsing event logs one line at a time.

shkhrgpt · 2017-05-16T01:30:30Z

Thanks @shankar37 for providing this detail.
I completely agree that using Spark replaybus and listeners is not a good idea. We are also facing the issue that some of the Spark listeners have changed from Spark 1.6 to 2.1, which makes Dr Elephant incompatible against Spark 2.1 enetlogs.
I strongly believe that we should completely eliminate the dependency on Spark, and therefore I like the motivation for this PR.
I just looked into SparkLogClient, and it has the logic to parse eventlog. However, it still depends on a couple of Spark classes, SparkListenerEnvironmentUpdate and SparkListenerEvent. Are you planning to remove those dependencies as well in the proposed eventlog parser?
Please let me know if I can help in this change.

shankar37 · 2017-05-16T11:46:06Z

I am planning to move that code into the new eventlogparser and expand on that. I will continue to rely on SparkListenerEvent and its derived classes as I didnt find any good way to remove that dependency. Do you have any ideas on how to not depend on that ?

shkhrgpt · 2017-05-16T16:48:35Z

I don't have any ideas either on how to remove these dependencies. Maybe we should think about it when you submit the PR. As of now, I think the goal should be to have the minimum dependency on Spark.

shkhrgpt · 2017-05-16T16:49:19Z

This change LGTM.
Thanks @shankar37

shkhrgpt · 2017-05-18T22:26:00Z

app/com/linkedin/drelephant/spark/fetchers/statusapiv1/statusapiv1.scala

-  val numCompletedStages: Int,
-  val numSkippedStages: Int,
-  val numFailedStages: Int)
+trait ApplicationInfo {


@shankar37
Why do we have these traits? Why can't we just have simple classes with var arguments?

shankar37 · 2017-05-22T04:58:48Z

These traits are returned as part of HadoopApplicationData interface. If we just use the classes with vars then the reader can potentially change the values and wont be read only. So, to keep the interface clean, we need the traits.

…

On Fri, May 19, 2017 at 3:56 AM, Shekhar Gupta ***@***.***> wrote: ***@***.**** commented on this pull request. ------------------------------ In app/com/linkedin/drelephant/spark/fetchers/statusapiv1/ statusapiv1.scala <#248 (comment)>: > - val description: Option[String], - val submissionTime: Option[Date], - val completionTime: Option[Date], - val stageIds: Seq[Int], - val jobGroup: Option[String], - val status: JobExecutionStatus, - val numTasks: Int, - val numActiveTasks: Int, - val numCompletedTasks: Int, - val numSkippedTasks: Int, - val numFailedTasks: Int, - val numActiveStages: Int, - val numCompletedStages: Int, - val numSkippedStages: Int, - val numFailedStages: Int) +trait ApplicationInfo { @shankar37 <https://github.com/shankar37> Why do we have these traits? Why can't we just have simple classes with var arguments? — You are receiving this because you were mentioned. Reply to this email directly, view it on GitHub <#248 (review)>, or mute the thread <https://github.com/notifications/unsubscribe-auth/AMlRI_lJAexf6SgqpdvlVkEE6F8fF-uZks5r7MWGgaJpZM4NbF6R> .

shankar37 · 2017-05-22T09:27:02Z

Shekhar, I am planning to merge this so that the next PR is a clean diff of the parsing changes alone. Let me know by EOD if you have any further comments. thanks shankar

…

On Mon, May 22, 2017 at 10:28 AM, Shankar M ***@***.***> wrote: These traits are returned as part of HadoopApplicationData interface. If we just use the classes with vars then the reader can potentially change the values and wont be read only. So, to keep the interface clean, we need the traits. On Fri, May 19, 2017 at 3:56 AM, Shekhar Gupta ***@***.***> wrote: > ***@***.**** commented on this pull request. > ------------------------------ > > In app/com/linkedin/drelephant/spark/fetchers/statusapiv1/statu > sapiv1.scala > <#248 (comment)>: > > > - val description: Option[String], > - val submissionTime: Option[Date], > - val completionTime: Option[Date], > - val stageIds: Seq[Int], > - val jobGroup: Option[String], > - val status: JobExecutionStatus, > - val numTasks: Int, > - val numActiveTasks: Int, > - val numCompletedTasks: Int, > - val numSkippedTasks: Int, > - val numFailedTasks: Int, > - val numActiveStages: Int, > - val numCompletedStages: Int, > - val numSkippedStages: Int, > - val numFailedStages: Int) > +trait ApplicationInfo { > > @shankar37 <https://github.com/shankar37> > Why do we have these traits? Why can't we just have simple classes with > var arguments? > > — > You are receiving this because you were mentioned. > Reply to this email directly, view it on GitHub > <#248 (review)>, > or mute the thread > <https://github.com/notifications/unsubscribe-auth/AMlRI_lJAexf6SgqpdvlVkEE6F8fF-uZks5r7MWGgaJpZM4NbF6R> > . >

shkhrgpt · 2017-05-22T16:27:38Z

@shankar37 LGTM.
Please merge this change.
Thank you.

…hese objects when we implement our own parser (linkedin#248)

shankar added 8 commits April 14, 2017 18:39

Merge remote-tracking branch 'linkedin/master'

d70cc6e

Fix Spark Metrics Aggregator to handle dynamic allocation. Now execut…

b6cbbe4

…or uptime is allocated resources and task time is used resources

Merge remote-tracking branch 'linkedin/master' into spark-fsfetcher-r…

e7c6d09

…efactor # Conflicts: # app/com/linkedin/drelephant/spark/fetchers/SparkFetcher.scala

Change formula to calculate the used and wasted metrics to handle dyn…

9d550f4

…amic allocation

fix build errors from the merge

81f2b28

refactor statusapiv1 to trait and implement for ease of creation of t…

9b74189

…hese objects when we implement our own parser

Revert "Change formula to calculate the used and wasted metrics to ha…

fcd99a9

…ndle dynamic allocation" This reverts commit 9d550f4.

fix build errors from revert

9a0983d

fix build error

713ca37

shkhrgpt reviewed May 18, 2017

View reviewed changes

akshayrai approved these changes May 23, 2017

View reviewed changes

akshayrai merged commit cae79c7 into linkedin:master May 23, 2017

skakker pushed a commit to skakker/dr-elephant that referenced this pull request Dec 14, 2017

Refactor statusapiv1 to trait and implement for ease of creation of t…

bfa5015

…hese objects when we implement our own parser (linkedin#248)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Refactor statusapiv1 to trait and implement for ease of creation of these objects when we implement our own parser #248

Refactor statusapiv1 to trait and implement for ease of creation of these objects when we implement our own parser #248

shankar37 commented May 15, 2017

superbobry commented May 15, 2017

shkhrgpt commented May 15, 2017

shankar37 commented May 16, 2017

shkhrgpt commented May 16, 2017

shankar37 commented May 16, 2017

shkhrgpt commented May 16, 2017 •

edited

Loading

shkhrgpt commented May 16, 2017

shkhrgpt May 18, 2017

shankar37 commented May 22, 2017 via email

shankar37 commented May 22, 2017 via email

shkhrgpt commented May 22, 2017

Refactor statusapiv1 to trait and implement for ease of creation of these objects when we implement our own parser #248

Refactor statusapiv1 to trait and implement for ease of creation of these objects when we implement our own parser #248

Conversation

shankar37 commented May 15, 2017

superbobry commented May 15, 2017

shkhrgpt commented May 15, 2017

shankar37 commented May 16, 2017

shkhrgpt commented May 16, 2017

shankar37 commented May 16, 2017

shkhrgpt commented May 16, 2017 • edited Loading

shkhrgpt commented May 16, 2017

shkhrgpt May 18, 2017

Choose a reason for hiding this comment

shankar37 commented May 22, 2017 via email

shankar37 commented May 22, 2017 via email

shkhrgpt commented May 22, 2017

shkhrgpt commented May 16, 2017 •

edited

Loading