I also meant to say that I am not too convinced, enthusiastic about knowledge graphs or any kind of annotations tagged onto texts, what Franzosi calls "coding" which I questioned in a rant about his book:
// __ somewhere between a draft for the story line of a movie and listening to Žižek ...
https://www.amazon.com/gp/customer-reviews/R1N1USZJW30O80/ ~ I think corpora should be self-describing from the texts themselves. Nothing more, nothing less. All kinds of functionality into corpora should be indexed into the texts themselves, not "described" as some sort of "value-added" whatever. Do you know of any work emphasizing these aspects, ways of understanding corpora? By the way I am a Mathematician/Physicist so my "metrical" way of seeing things would be influenced wrongly or not by my background lbrtchx