Sai: Lowercase converter in Python by b-sai · Pull Request #190 · kscanne/5030

b-sai · 2023-02-16T15:33:57Z

No description provided.

austin-carnahan

Looks good! I had a hard time finding anything to improve on!

austin-carnahan · 2023-02-16T19:07:11Z

S23/b-sai/main.py

+            if letter == 'I':
+                lower_letter = 'ı'
+        elif language.startswith(('gd', 'gv', 'ga')):
+            if idx == 1 and (letter in ['A', 'E', 'I', 'O', 'U', 'Á', 'É', 'Í', 'Ó', 'Ú', "Ó"] or ord(letter) in [211]) and word[0] in ['n', 't'] and (len(word)-idx >= 2 and ord(word[idx+1]) != 771):


I think this conditional statement is a little long - it took me a minute to tease out what it's saying. Consider breaking it up on multiple lines or adding some inline documentation to explain.

austin-carnahan · 2023-02-16T19:11:16Z

S23/b-sai/main.py

+        elif language.startswith('el'):
+            if letter == 'Σ' and idx == len(word)-1:
+                lower_letter = 'ς'
+        elif language.startswith(("zh", "th", "ja")):


It's not going to save a ton of time, but if you aren't making any changes to the letters in these languages -- you don't need to loop through each character. Consider moving this conditional to the top of your loop and exiting early.

kscanne · 2023-02-23T01:45:13Z

S23/b-sai/main.py

+    """
+    result = ""
+
+    if language.startswith(("zh", "th", "ja")):


There are 3-letter language code permitted in BCP-47, so "startswith" won't work here, e.g. "jam" is "Jamaican Creole English".

kscanne · 2023-02-23T01:45:47Z

S23/b-sai/main.py

+    result = ""
+
+    if language.startswith(("zh", "th", "ja")):
+        return word.lower()


And the point was in these cases to not bother calling lower() as an optimization.

kscanne · 2023-02-23T01:46:25Z

S23/b-sai/main.py

+        if language == 'tr' or language == 'az':
+            if letter == 'I':
+                lower_letter = 'ı'
+        elif language.startswith(('gd', 'gv', 'ga')):


Same issue as above; this won't work.

kscanne · 2023-02-23T01:47:18Z

S23/b-sai/main.py

+            is_2nd_letter = idx == 1
+            is_exception_letter = letter in [
+                'A', 'E', 'I', 'O', 'U', 'Á', 'É', 'Í', 'Ó', 'Ú', "Ó"]
+            is_letter_o_latin = ord(letter) in [211]


Magic numbers, here and 771 below! Unreadable.

kscanne · 2023-02-23T01:47:52Z

S23/b-sai/main.py

+    word, language, actual = test.split("\t")
+    predicted = to_lowercase(word, language)
+    if predicted != actual:
+        print(f"COuldn't convert {word} in {language}!")


Small typo.

kscanne · 2023-02-23T01:48:37Z

S23/b-sai/main.py

+            is_letter_o_latin = ord(letter) in [211]
+            is_beginning_exception = word[0] in ['n', 't']
+            is_not_last = len(word)-idx > 1
+            if is_2nd_letter and (is_exception_letter or is_letter_o_latin) and is_beginning_exception and (is_not_last and ord(word[idx+1]) != 771):


The Unicode business needs a bit more work; will discuss in class.

b-sai added 4 commits February 12, 2023 11:35

initial implementation complete

341c74c

moved file to right repo

84c113a

bug fix

3aefce8

adding converter

702f028

austin-carnahan reviewed Feb 16, 2023

View reviewed changes

b-sai added 2 commits February 22, 2023 15:22

Merge remote-tracking branch 'upstream/master' into lowercase

5c4a138

fix issues

ea00c69

kscanne reviewed Feb 23, 2023

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Sai: Lowercase converter in Python#190

Sai: Lowercase converter in Python#190
b-sai wants to merge 6 commits intokscanne:masterfrom
b-sai:lowercase

b-sai commented Feb 16, 2023

Uh oh!

austin-carnahan left a comment

Uh oh!

austin-carnahan Feb 16, 2023

Uh oh!

austin-carnahan Feb 16, 2023

Uh oh!

kscanne Feb 23, 2023

Uh oh!

kscanne Feb 23, 2023

Uh oh!

kscanne Feb 23, 2023

Uh oh!

kscanne Feb 23, 2023

Uh oh!

kscanne Feb 23, 2023

Uh oh!

kscanne Feb 23, 2023

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

b-sai commented Feb 16, 2023

Uh oh!

austin-carnahan left a comment

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants