Author Archives: Moissinac

Using TfIdf concepts on graphs

Tf-Idf is a weighting method, often used to evaluate the importance of a word in a document.The idea here is to evaluate the importance of a predicate or a predicate-value pair for an entity of a semantic graph. Tf-Idf on … Continue reading

Posted in Non classé | Leave a comment

Artworks in Wikidata

As part of the Data&Musée project, we are interested in the related data available about artworks, artists and museums and monuments related to these works. We have already addressed this issue in the Artworks in DBpedia post. We now look … Continue reading

Posted in Data&Musée, Public data | Leave a comment

Paris Musées and Wikidata: establishing links

As of 6/1/2019, my list of establishments attached to Paris Museums includes 14 museums with 16 denominations (see at the end of this post). It was built by hand from the web site de Paris Musées.I have established several methods for … Continue reading

Posted in Public data, Semantic taging, SPARQL | Leave a comment

Find the Wikidata element corresponding to an entity we know in DBPedia

Suppose we are interested in an entity in DBPedia, for example:http://fr.dbpedia.org/resource/Mus%C3%A9e_Carnavalet (associated web page http://fr.dbpedia.org/page/Mus%C3%A9e_Carnavalet)which describes the Caranavalet Museum.We want to automatically find a possible entity in Wikidata describing the same entity.In DBPedia, an entity is always associated with the … Continue reading

Posted in Data&Musée | Leave a comment

Statistics about DBPedia-fr

I needed the number of separate entities described in dbpedia-fr. We will see a problem to consider when using linked data that uses public access points.My first attempt was to get this information with the query select count(distinct ?r) where … Continue reading

Posted in Non classé | Leave a comment

Where Telecom ParisTech publishes regularly: technical viewpoint

In the article “Where Telecom ParisTech publishes regularly“, I showed an example of use of the semantic representation of our bibliography: a graph that allows to see the lecture series mainly used by Telecom ParisTech researchers to publish scientific results … Continue reading

Posted in Non classé | Leave a comment

Where Telecom ParisTech publishes regularly

(click to magnify)Our bibliographic database does not make it easy to highlight conferences and journals where we publish often. The semantic approach that we started with the SemBib project provides answers.As part of the SemBib project, a semi-automatic work was … Continue reading

Posted in Non classé | Leave a comment

Using NLTK on Heroku with Python

On the principle of the “Extract PDF text with Python” ticket, I will create a service that uses the NLTK package. NLTK is a set of tools for building language processing programs in Python. It therefore requires using Python. Basic … Continue reading

Posted in NLP, Tutorial | Leave a comment

Services for bibliographic analysis

I present here the needs related to our approach of analysis of the production and publication of scientific documents – essentially articles – by Telecom ParisTech. It is the goal of the SemBib project. The articlesTélécom ParisTech has a bibliographical … Continue reading

Posted in NLP, SemBib | Leave a comment

Extract PDF text with Python

As part of our SemBib project to analyze the scientific production of Telecom ParisTech, I recover a lot of PDF files. To analyze the content, I need to get the raw text. In addition, as indicated in the blog Services … Continue reading

Posted in NLP, SemBib, Tutorial | Leave a comment

First contact with the tools of the Bibliographic Agency of Higher Education

As part of the SemBib project, I was led to choose a unique identifier for each author. Following my usual strategy, I started by using identifiers defined in our namespace, with our prefix. Thus, it was possible to produce results … Continue reading

Posted in Public data, Semantic taging, SemBib | 1 Comment

Unique Identifiers of Researchers versus Unicity of Identifiers of Researchers

As mentioned in the article “First contact with the tools of the ABES“, for the SemBib project, I started by using my own identifiers for the researchers. Then, I wanted to use identifiers coming from reference sources, starting with the … Continue reading

Posted in Semantic taging, SemBib | Leave a comment

Getting Started with SPARQL Access Point from the Springer Editor

As part of the SemBib project, I will discover with you the SPARQL public access point of the Springer scientific editor at http://lod.springer.com/sparql-form/index.html. For a first contact, we must get acquainted and some classic requests will help us.First, discover the … Continue reading

Posted in Public data, Semantic taging, SPARQL | Leave a comment

A country without war

I saw the question “Is the a country that has never been in a war?” qui renvoie à “World peace? These are the only 11 countries in the world that are actually free from conflict“and I thought that’s a good exercise for … Continue reading

Posted in DBpedia, Public data, Semantic taging, SPARQL, Tutorial, web | Leave a comment

CORS, semantic web and linked data

In this post, I talk about CORS and solutions to use data from a server different than the web page which use it.The development of the semantic web and linked data certainly use through the development of websites that operate … Continue reading

Posted in SPARQL, web | Tagged , | Leave a comment

Some french SPARQL endpoints

A short post, which will evolve from time to time, to list some SPARQL endpoints which have a significant relation with France, for example: data produced by a french institution; data about french resources. BNFThe Bibliothèque Nationale de France, the main … Continue reading

Posted in SPARQL | Tagged , , , | Leave a comment

Web interaction based on a simple ontology

As part of our work, we identified several ontologies on which a useful visualization mode is a tree of concepts display. In fact, in many ontologies or even set of RDF triples, a ‘view’ through the data can be a … Continue reading

Posted in OWL Cookbook, Visualization, web | Tagged , , | 1 Comment

Using public data: educational resources

Our team has undertaken reuse of public data , especially in the field of culture. For this, we treat public data available on data.gouv.fr to improve their use in semantic web and LOD ( Linked Open Data) .The first set … Continue reading

Posted in DBpedia, OWL Cookbook, Public data | Tagged , , | Leave a comment

Ontologies, properties and inheritance of features

I know that I will hurt some specialists of ontologies in speaking about inheritance of features.But let me tell a story.I’m quite enthusiast about using ontologies but also I was  recently a newbie in the domain. I do quick progress … Continue reading

Posted in OWL Cookbook | Leave a comment