You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Is there a way to make wordninja aware of capital letters and be able to cut at caps boundaries if not informed by the dictionary? Or is there a way to build a caps aware dictionary?
for example: rCBVmeanSD should be cut as rCBVmeanSD. But of course my domain specific language model must both define mean and means and the resulting cut is rCBV', 'meanS', 'D'.
Thoughts?
The text was updated successfully, but these errors were encountered:
Yeah I'm pretty sure the existing model and code is case blind. I'd be
open to a PR and a language model trained on case. The statistical model
should be equivalent just the language would be larger.
On Tue, May 26, 2020, 6:31 PM Paul Perry ***@***.***> wrote:
Is there a way to make wordninja aware of capital letters and be able to
cut at caps boundaries if not informed by the dictionary? Or is there a way
to build a caps aware dictionary?
for example: rCBVmeanSD should be cut as rCBV mean SD. But of course my
domain specific language model must both define mean and means and the
resulting cut is rCBV', 'meanS', 'D'.
Thoughts?
—
You are receiving this because you are subscribed to this thread.
Reply to this email directly, view it on GitHub
<#14>, or unsubscribe
<https://github.com/notifications/unsubscribe-auth/AAPUNAOK53WIUENXYF24EPDRTRUPJANCNFSM4NLRDHJQ>
.
Is there a way to make wordninja aware of capital letters and be able to cut at caps boundaries if not informed by the dictionary? Or is there a way to build a caps aware dictionary?
for example:
rCBVmeanSD
should be cut asrCBV
mean
SD
. But of course my domain specific language model must both definemean
andmeans
and the resulting cut isrCBV
', 'meanS
', 'D
'.Thoughts?
The text was updated successfully, but these errors were encountered: