Extending the RDF Knowledge Graph SemOpenAlex.org

Objective

This topic is about working on SemOpenAlex (https://semopenalex.org), a comprehensive RDF knowledge graph that includes over 26 billion triples related to scientific publications, authors, institutions, journals, and more. This open-access initiative offers data through RDF dump files, a SPARQL endpoint, and the Linked Open Data cloud, enhancing the visibility and accessibility of scientific research.

What are the tasks?

  • Keeping SemOpenAlex up-to-date by updating its schema according to changes in the OpenAlex database and performing periodic updates to the RDF database.
  • Expanding SemOpenAlex, e.g., by introducing author name disambiguation, integrating representations of code repositories like GitHub, and linking to other databases and knowledge graphs such as LinkedPaperWithCode.com, Wikidata, and DBLP.

What prerequisites do you need?

  • Basic understanding of RDF and enthusiasm for semantic web and open data.
  • Programming skills in Python, which are critical for various tasks including database maintenance and development.

Further Reading

https://arxiv.org/abs/2308.03671

Contact Person

Prof. Dr.-Ing. Michael Färber, michael.faerber@tu-dresden.de




Enjoy Reading This Article?

Here are some more articles you might like to read next:

  • a post with image galleries
  • Stock Market Predictions through Deep Learning
  • Designing and Executing a Large-Scale User Study on Scientific Text Simplification
  • Using Quantum Computing in Natural Language Processing
  • Advanced Multi-Modality Learning in Electronic Health Records for Personalized Medical Recommendations