Hi Adam,
Just to clarify, I think your examples would actually receive different structures in (basic) UD:
*I [came and [bought a book]]* - "book" should attach to "bought", making it clear that it is not the object of "came"
*I [[saw and bought] a book] *- "book" should attach to "saw" as the first element of the coordination
The second case is formally ambiguous because, per the UD tree, "book" could properly be the object of "saw and "bought" or just "saw" (but the latter would be weird given English syntax).
Another illustration of the ambiguity is with modifiers:
*I [recently sang] and [danced]*
*I recently [sang and danced]*
would be the same in basic UD, and are both valid interpretations in English.
Cheers, Nathan
On Thu, Oct 13, 2022 at 3:07 AM Adam Przepiórkowski via Corpora < corpora@list.elra.info> wrote:
Dear All,
I am looking for treebanks (of any kind; dependency, constituency, LFG, HPSG, …) with good – preferably manual – unambiguous annotation of coordinate structures, for any language.
A typical UD treebank does not have a good annotation of coordinations, because vanilla UD does not distinguish between dependents of single conjuncts, as in *I [came and [bought a book]]*, and shared dependents of conjuncts, as in *I [[saw and bought] a book]*. Enhanced UD can in principle make this distinction, but many EUD treebanks are automatically converted from vanilla UD treebanks, so this information is also often not available or not reliable. On the other hand, many constituency treebanks (including PTB) do not have explicit information about governors of coordinations (in *I bought John and Mary interesting books* the governor of *John and Mary* is *bought* and not, say, *books*), and – perhaps surprisingly – it is often not easy to guess the governor. So I am looking for treebanks that wear both kinds of information – about shared dependents and about governors – on their sleeves.
Thanks, best, Adam P. _______________________________________________ Corpora mailing list -- corpora@list.elra.info https://list.elra.info/mailman3/postorius/lists/corpora.list.elra.info/ To unsubscribe send an email to corpora-leave@list.elra.info