A simple method of extracting keywords from texts

poster / demo / art installation
Authorship
  1. 1. Maciej Eder

    Pedagogical University of Krakow, Institute of Polish Language - Polish Academy of Sciences

  2. 2. Michał Woźniak

    Institute of Polish Language - Polish Academy of Sciences

Work text
This plain text was ingested for the purpose of full-text search, not to preserve original formatting or readability. For the most complete copy, refer to the original conference program.

The proposal focuses on keywords extraction; its aim is two-fold. Firstly, the paper provides an evaluation of the existing techniques, namely log-likelihood keyword analysis, Zeta as developed by Burrows and refined by Craig, as well as TF-IDF weighting. Secondly, the paper introduces a brand-new method of extracting meaningful keywords, which relies on a simple observation that ordered word frequencies provide enough information about particular words’ potential keyness.

If this content appears in violation of your intellectual property rights, or you see errors or omissions, please reach out to Scott B. Weingart to discuss removing or amending the materials.

Conference Info

In review

ADHO - 2020
"carrefours / intersections"

Hosted at Carleton University, Université d'Ottawa (University of Ottawa)

Ottawa, Ontario, Canada

July 20, 2020 - July 25, 2020

475 works by 1078 authors indexed

Conference cancelled due to coronavirus. Online conference held at https://hcommons.org/groups/dh2020/. Data for this conference were initially prepared and cleaned by May Ning.

Conference website: https://dh2020.adho.org/

References: https://dh2020.adho.org/abstracts/

Series: ADHO (15)

Organizers: ADHO