KinDigi Social: A Mobile-centered Social Annotation Platform for the Kindai Digital Library

paper, specified "short paper"
Authorship
  1. 1. Yuta Hashimoto

    Kyoto University

  2. 2. Yasuyuki Araki

    National Diet Library

Work text
This plain text was ingested for the purpose of full-text search, not to preserve original formatting or readability. For the most complete copy, refer to the original conference program.


KinDigi Social: A Mobile-centered Social Annotation Platform for the Kindai Digital Library

Hashimoto
Yuta

Kyoto University, Japan
yhashimoto1984@gmail.com

Araki
Yasuyuki

National Diet Library, Japan
ya-araki@ndl.go.jp

2014-12-19T13:50:00Z

Paul Arthur, University of Western Sidney

Locked Bag 1797
Penrith NSW 2751
Australia
Paul Arthur

Converted from a Word document

DHConvalidator

Paper

Short Paper

social annotation
digital collections
online collaboration
open annotation

historical studies
interface and user experience design
software design and development
programming
social media
crowdsourcing
semantic web
linking and annotation
English

The Kindai Digital Library (KDL) is an online collection of out-of-copyright books published in Japan in the 19th and 20th centuries run by the National Diet Library (NDL) of Japan.
1 It offers full access to images scanned from books previously only available in the NDL. As of November 2014, the total number of volumes in the collection has grown to around 360,000. Although digitized text of the scans is not available due to the technical difficulties of applying OCR to old Japanese type fonts, the collection has been a precious resource for studies of the history of modern Japan since its start in 2003.

KinDigi Social is an online platform for social annotation of digitized books in KDL; it offers iOS and Android clients to browse books in the KDL collection and provides a set of features to create and share annotations and tags, allowing users to exchange and discuss historical knowledge and ideas concerning the books. The project is one of the experimental projects supported by the NDL Lab, an R&D section of NDL,
2 and is going to be available at http://kindigi-social.org/ by the spring of 2015. In this paper, we will discuss the aims, methods, and implementation of the KinDigi Social platform.

Aim of the Project
The past several years saw growing interest in the use of social media for humanities studies in the digital humanities world. Possibilities of online collaboration for research in the humanities have been discussed on various occasions and put into practice in a number of research projects (Siemens et al, 2012; Barr and Tonra, 2014). Our research attempt is to apply the methodologies and techniques suggested by the results of those studies to the KDL. The need for such an attempt becomes clear when we see the numerous references and comments on books in the KDL made by professional and amateur historians not only in their academic papers but also in their blogs and social media posts (search ‘kindai.ndl.go.jp’ on Twitter, for instance). Although this sort of user-generated content is often quite valuable in that it can provide better understandings of the historical context of books, it has been left scattered over the Web and has not been archived for later references, which has resulted in a great loss of intellectual resources. Thus KinDigi Social aims to be a platform that aggregates and accumulates such annotations, and enables interdisciplinary discussion and collaboration among scholars for the purpose of further knowledge creation.
Methods
KinDigi Social allows its users to create annotations on both image region parts of a book and on the text itself. Each annotation accepts replies from other users so that idea-exchanging and discussion can take place. It is also possible to label each annotation with Twitter-like hashtags. On the other hand, the mere capacity to create annotations is not enough to build a sustainable social medium that will keep driving engagement of users and that can endure long-term use as an archive of historical knowledge; there are difficulties concerning user interfaces, information sharing, archiving and reuse of user-generated content, among other issues. Therefore we took the following three additional measures:

Mobile first development: Since currently the KDL doesn’t offer a user interface optimized for mobile devices, its users need to keep their eyes on PC displays to read the books held in the KDL. The loss of productivity derived from the limitations of this user interface cannot and should not be ignored. Although a desktop client is also planned for KinDigi Social, we have especially focused on building a rich mobile client that enables users to access the KDL almost anywhere using their iPads or other mobile devices, and provides them the same user experience gained from the use of mobile readers such as the Amazon Kindle (see Figure 1). One large problem with displaying scanned book images on mobile devices is their size; the images from the KDL have margins surrounding the actual books. For this we developed a lightweight image analysis library that automatically detects edges in a book image and removes unnecessary margins from it (see Figure 2). The code is open-source and available on Github.
3


Support for real-time collaboration: As the recent success of social media shows, real-time communication between users is indispensable for their continual engagement. In order to support this kind of collaboration, KinDigi Social implements the following Twitter-like system: user activities, such as the creation of an annotation, will immediately prompt a notification sent to the user’s ‘followers’ through their social feeds, for real-time responses and information sharing (see Figure 3). Likewise it is possible to ‘watch’ any book in KDL just like a GitHub repository so that the user will get a notification when an annotation is created on a book. This system also contributes to forming loosely bound clusters of users. For those who do not want to make their annotations public, support for private annotations is also planned.


Data modeling following the Open Annotation specification: As a description model of user-generated annotations, KinDigi Social implements the Open Annotation Data Model, an RDF-based framework designed for modeling annotations on web content whose specification has been developed by a W3C community group (Hunter, 2010).
4 It provides controlled and machine-friendly vocabulary for modeling not only annotations but also replies from other users, folksonomy tags, and links to other resources. The adoption of the Open Annotation framework makes it possible to store and serialize user-derived content in an interoperable and reusable way for their long-term preservation and for secondary uses such as text analysis.

Figure 1. The iOS client of KinDigi Social running on an iPad.

Figure 2. An example of the automatic detection of book edges.

Figure 3. An example of social feed of user activities.

Figure 4. System architecture of KinDigi Social platform.
Implementation
As shown in Figure 4, KinDigi Social is a server-client system. The web server is written in Ruby on Rails and is hosted on a physical server in the NDL Lab where it retrieves scanned images and the bibliographic metadata of books in the KDL through the APIs provided by the NDL. It also exchanges JSON messages with its iOS and Android clients through its REST API. The mobile clients are built with HTML5 using the Apache Cordova framework,
5 so that a single source code will generate distributions for different platforms.

Annotation data generated by users is at first stored in a relational database (PostgreSQL), and is regularly converted and dumped into an RDF database and made public through its SPARQL endpoint. In this way annotation data can be easily retrieved and processed by standard semantic web technologies. We chose a relational database as a primary method of data storage purely for performance and maintenance reasons.
The user authentication is done via OAuth 2.0 protocol; users need to use their Facebook or Twitter account to log in to KinDigi Social (Authentication via OpenID is also planned). This implementation is intended to make it easy to connect with other social media platforms and to prevent vandalism by anonymous users.
Conclusion and Future Directions
Our basic standpoint is that every historical digital collection should behave like a living organism; it should continue to grow, absorbing its users’ knowledge and ideas and connecting to other historical resources. The online collaboration of scholars enhanced by real-time and mobile computing, and the standardized archive system built with Open Annotation model will contribute to the transformation of currently static digital collections into such dynamic organisms.
Although our project is still in its very early stages and we first need to evaluate how KinDigi Social can contribute to humanities research based on digital resources, a possible extension of our project would be to include support for multiple digital archives, and collections other than the KDL. A prime candidate is the digital archive administered by the Japan Center for Asian Historical Records (JACAR),
6 which holds the official documents of the Japanese Cabinet dating from the period prior to World War II. If it were possible to search the annotations created in both the KDL and JACAR seamlessly with a single query, that would greatly benefit studies of the history of modern politics in Japan, and we therefore hope to offer this functionality with our project.

Notes
1. Kindai Digital Library. http://kindai.ndl.go.jp/?__lang=en.
2. NDL Lab, http://lab.kn.ndl.go.jp/cms/.
3. Kindai-cropper (a github repository). https://github.com/yuta1984/kindai-cropper.
4. Open Annotation Data Model, http://www.openannotation.org/spec/core/.
5. Apache Cordova, http://cordova.apache.org/.
6. Japan Center for Asian Historical Records, http://www.jacar.go.jp/english/index.html.

Bibliography

Barr, R. A. and Tonra, J. (2014). Crowdsourcing Annotation and the Social Edition: Ossian Online.
DH2014.

Hunter, J., et al. (2010). The Open Annotation Collaboration: A Data Model to Support Sharing and Interoperability of Scholarly Annotations.
Digital Humanities 2010: Conference Abstracts, pp. 175–78.

Siemens, R., et al. (2012). Toward Modeling the Social Edition: An Approach to Understanding the Electronic Scholarly Edition in the Context of New and Emerging Social Media.
Literary and Linguistic Computing,
27(4): 445–61.

If this content appears in violation of your intellectual property rights, or you see errors or omissions, please reach out to Scott B. Weingart to discuss removing or amending the materials.

Conference Info

Complete

ADHO - 2015
"Global Digital Humanities"

Hosted at Western Sydney University

Sydney, Australia

June 29, 2015 - July 3, 2015

280 works by 609 authors indexed

Series: ADHO (10)

Organizers: ADHO