Classifying Patents Based on their Semantic Content
Antonin Bergeaud,
Yoann Potiron and
Juste Raimbault
Working papers from Banque de France
Abstract:
In this paper, we extend some usual techniques of classification resulting from a largescale data-mining and network approach. This new technology, which in particular is designed to be suitable to big data, is used to construct an open consolidated database from raw data on 4 million patents taken from the US patent office from 1976 onward. To build the pattern network, not only do we look at each patent title, but we also examine their full abstract and extract the relevant keywords accordingly. We refer to this classification as semantic approach in contrast with the more common technological approach which consists in taking the topology when considering US Patent office technological classes. Moreover, we document that both approaches have highly different topological measures and strong statistical evidence that they feature a different model. This suggests that our method is a useful tool to extract endogenous information.
Keywords: Patents; Semantic Analysis; Network; Modularity; Innovation; USPTO (search for similar items in EconPapers)
JEL-codes: O3 O39 (search for similar items in EconPapers)
Pages: 40 pages
Date: 2018
New Economics Papers: this item is included in nep-big, nep-ino and nep-ipr
References: Add references at CitEc
Citations: View citations in EconPapers (7)
Downloads: (external link)
https://publications.banque-france.fr/sites/defaul ... /documents/wp685.pdf (application/pdf)
Related works:
Journal Article: Classifying patents based on their semantic content (2017) 
This item may be available elsewhere in EconPapers: Search for items with the same title.
Export reference: BibTeX
RIS (EndNote, ProCite, RefMan)
HTML/Text
Persistent link: https://EconPapers.repec.org/RePEc:bfr:banfra:685
Access Statistics for this paper
More papers in Working papers from Banque de France Banque de France 31 Rue Croix des Petits Champs LABOLOG - 49-1404 75049 PARIS. Contact information at EDIRC.
Bibliographic data for series maintained by Michael brassart ().