Getting the best language match from predictions #6

goodmami · 2017-07-06T19:58:22Z

Currently (when the code works), it only returns the True/False prediction and its score (as model.Distribution objects). It may be the case that more than one, or none, of the languages are chosen as True. The score of the prediction should be used to rank the list of languages for a span, then use that one for the final prediction.

MackieBlackburn · 2017-07-29T19:01:03Z

Should this be done by modifying the test() function in main.py?

goodmami · 2017-07-31T05:07:22Z

Yeah I suppose. Here's the relevant code block in the test() function:

for dist in model.test(instances):
    print(dir(dist))
    print(dist.classes())

You could write a function to normalize the values (e.g. set the one with the highest confidence of a False value to 0, the highest confidence of a True value to 1, and scale everything else accordingly. Then replace the code block above with something like:

ranked_list = normalize_probabilities(model.test(instances))
if len(ranked_list) != 0:
    top = ranked_list[0]
    ...

MackieBlackburn · 2017-07-31T19:27:06Z

Reviewing the code, it looks like the model.test() function returns a Distribution object, which contains a dictionary of class to probability. Each Distribution object also has a best_class field, so if I'm not mistaken this issue might be solved by doing

for dist in model.test(instances):
   print(dir(dist))
   top = dist.best_class

I can put some normalization code into the Distribution class to make sure the probabilities are normalized.

goodmami · 2017-07-31T19:35:08Z

Hmm, possibly. I didn't write models.py, but I thought it returned a distribution for each language, and the classes were True and False, so if something had a high probability for False, best_class would return False for the language that distribution was made for.

I could be wrong though.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Getting the best language match from predictions #6

Getting the best language match from predictions #6

goodmami commented Jul 6, 2017

MackieBlackburn commented Jul 29, 2017

goodmami commented Jul 31, 2017

MackieBlackburn commented Jul 31, 2017 •

edited

Loading

goodmami commented Jul 31, 2017

Getting the best language match from predictions #6

Getting the best language match from predictions #6

Comments

goodmami commented Jul 6, 2017

MackieBlackburn commented Jul 29, 2017

goodmami commented Jul 31, 2017

MackieBlackburn commented Jul 31, 2017 • edited Loading

goodmami commented Jul 31, 2017

MackieBlackburn commented Jul 31, 2017 •

edited

Loading