-
Notifications
You must be signed in to change notification settings - Fork 23
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Carefulcase eats words it can't generate #35
Comments
I lost my laptop three weeks ago, so it'll be a while before I can look at
this.
…On Thursday, 25 October 2018, Kevin Brubeck Unhammer < ***@***.***> wrote:
Assigned #35 <#35> to
@jimregan <https://github.com/jimregan>.
—
You are receiving this because you were assigned.
Reply to this email directly, view it on GitHub
<#35 (comment)>, or mute
the thread
<https://github.com/notifications/unsubscribe-auth/AAN4FoMJsbXjFMdMmrDhxgKbjMluOebpks5uoYc4gaJpZM4X53Y5>
.
|
ouch :(( |
I added some tests in fd6e6dc – it turns out to be problematic if we start generating But if there were no such garden paths, |
@mr-martian Do you think this is solvable? I'd love to have a solution for this (but in bilingual mode <e> <re>[a-zA-Z]+</re><p><l></l><r><s n="np"/></r></p></e>
<e> <i>med</i> <p><l></l><r><s n="pr"/></r></p></e> and get $ echo '^Med<pr>$ ^AbCd<np>$' |lt-proc -C -b nob-nno.autogen.bin
^Med<pr>/Med$ ^AbCd<np>/AbCd$ Currently, we can get either the one or the other: $ echo '^Med<pr>$ ^AbCd<np>$' |lt-proc -C tmp.bin # eats Med
AbCd
$ echo '^Med<pr>$ ^AbCd<np>$' |lt-proc -b tmp.bin # includes extra "Abcd"
^Med<pr>/Med$ ^AbCd<np>/AbCd/Abcd$
$ echo '^Med<pr>$ ^AbCd<np>$' |lt-proc -c -g tmp.bin # fails to generate Med since lemma is lowercase
#Med AbCd Possibly related to #167 |
If the dictionary has
then we get
I would like it to just fall back to "normal" generation for words it can't find exact case for, ie.
while still retaining the -C functionality for words it can find exact matches for
The text was updated successfully, but these errors were encountered: