Search Mailing List Archives


Limit search to: Subject & Body Subject Author
Sort by: Reverse Sort
Limit to: All This Week Last Week This Month Last Month
Select Date Range     through    

[protege-discussion] Hackathon at ESWC

Alexander Garcia Castro alexgarciac at gmail.com
Mon Apr 22 11:52:22 PDT 2013


Hi all, we are having a Hackathon while at the ESWC.

May 27 in Montpillier, France.

Theme:



The ability to extract meaningful, machine-interpretable data from

scholarly publications in PDF form is a big challenge.  Several open

source libraries exist that attempt to automate this process, but work

needs to be done on them to improve accuracy and reliability.  Some

specific and relevant  challenges include:



Ability to automatically identify and tokenize citations from the PDF

(or more accurately, from a string of text)

Ability to automatically identify those blocks of text that represent

the narrative in a PDF.

Ability to identify references within the narrative, extract their

scope, and associate them with citation information in the PDF.



Anybody interested is welcome to join us, http://scholrev.org/hackathon/

Please contact Casey McLaughlin <casey.mclaughlin at cci.fsu.edu>

--
Alexander Garcia
http://www.alexandergarcia.name/
http://www.usefilm.com/photographer/75943.html
http://www.linkedin.com/in/alexgarciac


More information about the protege-discussion mailing list