https://emanjavacas.github.com/slides/datech17
Spread of non-reusable data
in multi-user, real-time environments
Unifying annotation schemes and their usage
Fundamental problem of collaborative software/distributed systems
“How to ensure that dispersed users are able to modify a shared body of documents without overwriting each other’s changes”
Collaborative Synchronized Corpus Annotation Tool
https://www.github.com/emanjavacas/cosycat
Real-time feedback client-to-client following a Pub/Sub architecture
Subscribers are assigned roles & permissions (resource access control)
Conflicts are explicitely resolved and documented using threads
Support for Token & Span annotations
{
"ann" : {
"key" : "pret",
"value" : "s"
},
"corpus" : "mbg-index",
"query" : "'gat'",
"span" : {
"type" : "token",
"scope" : 133601,
"doc" : "A01342"
},
"timestamp" : 1476095003208,
"username" : "OscarStrik",
"version" : 0
}
{
"ann" : {
"key" : "cxnele",
"value" : "getobj"
},
"corpus" : "mbg-index",
"query" : "'gat'",
"span" : {
"type" : "IOB",
"scope" : {
"B" : 3222,
"O" : 3224
},
"doc" : "99863288"
},
"timestamp" : 1476095439636,
"username" : "OscarStrik",
"version" : 0
}
Version-controlled: Each annotation has a revision history