Select the language
Other research blogs of our team
-
Recent Posts
Category Archives: NLP
Using NLTK on Heroku with Python
On the principle of the “Extract PDF text with Python” ticket, I will create a service that uses the NLTK package. NLTK is a set of tools for building language processing programs in Python. It therefore requires using Python. Basic … Continue reading
Posted in NLP, Tutorial
Leave a comment
Services for bibliographic analysis
I present here the needs related to our approach of analysis of the production and publication of scientific documents – essentially articles – by Telecom ParisTech. It is the goal of the SemBib project. The articlesTélécom ParisTech has a bibliographical … Continue reading
Posted in NLP, SemBib
Leave a comment
Extract PDF text with Python
As part of our SemBib project to analyze the scientific production of Telecom ParisTech, I recover a lot of PDF files. To analyze the content, I need to get the raw text. In addition, as indicated in the blog Services … Continue reading
Posted in NLP, SemBib, Tutorial
Leave a comment