You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardexpand all lines: inflection_finder.py
+53-30
Original file line number
Diff line number
Diff line change
@@ -1,19 +1,21 @@
1
1
#!/usr/bin/python
2
2
# -*- coding: utf-8 -*-
3
-
importsys
4
-
importgetopt
5
-
importlogging
6
-
importunicodecsv
7
-
importjsondoa
3
+
importgetopt, logging, sys
4
+
importjsondoa, unicodecsv
5
+
importitertools
8
6
fromcollectionsimportdefaultdict
9
7
10
8
"""
11
9
An early step in any decipherment process: Is the language inflected?
10
+
12
11
"walked" and "walking" are inflections of "walk", as is "dogs" of "dog".
13
-
Won't catch irregular inflections like tener 'to have' inflected as tiene '(s)he has' and tengo 'i have' in Spanish, but is an organized step in approaching an undeciphered writing system
12
+
13
+
This won't catch irregular inflections like tener 'to have' inflected as tiene '(s)he has' and
14
+
tengo 'i have' in Spanish, but this is an organized step in approaching an undeciphered
inputs a list of potential inflections, outputs a list of potential cases
245
255
246
-
For example, if we receive the list (ko-no-so, ko-no-si-jo, ko-no-si-ja, ko-no-so-de), we will receive in response (o, i-jo, i-ja, o-de) as ko-no-s is ubiquitous.
247
-
How do we handle edge cases where the common ground is a complete set, i.e. (ko-no-so, ko-no-so-de)? should have a way to return root + -de instead of just -de
256
+
For example, if we receive the list (ko-no-so, ko-no-si-jo, ko-no-si-ja, ko-no-so-de),
257
+
we will receive in response (o, i-jo, i-ja, o-de) as ko-no-s is ubiquitous.
258
+
How do we handle edge cases where the common ground is a complete set, i.e. (ko-no-so,
259
+
ko-no-so-de)? should have a way to return root + -de instead of just -de
0 commit comments