https://emanjavacas.github.com/slides/datech17
Spread of non-reusable data
in multi-user, real-time environments
Unifying annotation schemes and their usage
Fundamental problem of collaborative software/distributed systems
“How to ensure that dispersed users are able to modify a shared body of documents without overwriting each other’s changes”
Collaborative Synchronized Corpus Annotation Tool
https://www.github.com/emanjavacas/cosycat
Real-time feedback client-to-client following a Pub/Sub architecture
Subscribers are assigned roles & permissions (resource access control)
Conflicts are explicitely resolved and documented using threads
Support for Token & Span annotations
{ "ann" : { "key" : "pret", "value" : "s" }, "corpus" : "mbg-index", "query" : "'gat'", "span" : { "type" : "token", "scope" : 133601, "doc" : "A01342" }, "timestamp" : 1476095003208, "username" : "OscarStrik", "version" : 0 }
{ "ann" : { "key" : "cxnele", "value" : "getobj" }, "corpus" : "mbg-index", "query" : "'gat'", "span" : { "type" : "IOB", "scope" : { "B" : 3222, "O" : 3224 }, "doc" : "99863288" }, "timestamp" : 1476095439636, "username" : "OscarStrik", "version" : 0 }
Version-controlled: Each annotation has a revision history