Hi David
What is the reason/purpose for your n-gram analysis or classification task? What software have you been using? Why not just exact string matching by characters? (Take log if length issues go out of hand?)
Best Ada
On Mon, Jun 26, 2023 at 1:45 PM David Beauchamp via Corpora < corpora@list.elra.info> wrote:
Thanks for the suggestion, will look in to it. _______________________________________________ Corpora mailing list -- corpora@list.elra.info https://list.elra.info/mailman3/postorius/lists/corpora.list.elra.info/ To unsubscribe send an email to corpora-leave@list.elra.info