Patstat
This presentation is the property of its rightful owner.
Sponsored Links
1 / 14

Patstat beyond Europe PowerPoint PPT Presentation


  • 127 Views
  • Uploaded on
  • Presentation posted in: General

Patstat beyond Europe. By Gianluca Tarasconi Madrid, 9/12/2010. An insight into Patstat data from patent authorities other than EP O. What is PATSTAT.

Download Presentation

Patstat beyond Europe

An Image/Link below is provided (as is) to download presentation

Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author.While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server.


- - - - - - - - - - - - - - - - - - - - - - - - - - E N D - - - - - - - - - - - - - - - - - - - - - - - - - -

Presentation Transcript


Patstat beyond europe

PatstatbeyondEurope

By Gianluca Tarasconi

Madrid, 9/12/2010

An insight into Patstat data from patent authorities other than EPO

1


What is patstat

What is PATSTAT

PATSTATstandsforEPO Worldwide PatentStatistical Database.Contains a snapshot of the EPO master documentation database (DOCDB) which contains data of about 90 national and international patent offices with different degree of coverage.Data include bibliographic data, citations and family links. This database is designed to be used for statistical research and requires the data to be loaded in the customer's own database.

http://www.epo.org/patents/patent-information/raw-data/test/product-14-24.html

http://forums.epo.org/epo-patstat-faqs/

2


Non epo data vs ape inv name game

Non EPO data vs APE-INV Name Game

Data from other patent authorities may help in:

Validate algorithms against other spellings/conventions;

Fill missing/correct data (FI address/city) using data from equivalents;

Use Patent Family(1) data to improve algorithms using other data to give a similarity score;

(1) For a list of patent family definitions see : C. Martinez Insight into Different Types of Patent Families, STI Working Paper 2010/2

3


Example i inpadoc family 75 mr roberts

Example (I): inpadoc family # 75, Mr Roberts

6 different spellings for name, 3 different addresses

In this case name and city are better parsed in US equivalent patent data;

4


Example ii inpadoc family 88 mr newman

Example (II): inpadoc family # 88, Mr Newman

WO patent data confirm that correct address is 43111 Robbins street

US patent tells us A. stand for Antony

5


What countries i

What countries (I)

Patstat contains 92 application authorities;

45 are inside Europe;

47 are outside Europe;

Contains regional/international authorities (WIPO; ARIPO…);

Contains also ‘terminated’ authorities (DDR, URSS)

6


What countries ii

What countries (II)

7


What dimensions are relevant

What dimensions are relevant

A) data coverage (% of coverage by year)

Are data from patent authority X 100% included into Patstat from year W to year Z ?

B) Data transmission delays

How long does it take a non EPO patent to reach in PATSTAT?

C) Completeness of geographic data

How is quality (and coverage) of address / city / country code ?

8


Data coverage i

Data coverage (I)

EPO gives partial informations

http://www.epo.org/patents/patent-information/data-quality.html

http://www.epo.org/patents/patent-information/raw-data/useful-tables.html

Total number of applications is given but not the % of total (EPO gives what it gets)

9


Data coverage ii example on india

Data coverage (II): example on India

In patstat are reported from EPO 66219 Indian applications

Indian Patent office reports 28.882 applications filed only for 2006

10


Data transimission delays i

Data Transimission delays (I)

We study time series 2003- 2008 for BR, CN, JP, DE, KR and IN compared to EP;

Graph differences suggest publication lags and data transmission lags differ from country to country;

Timeseries may also highlight ‘holes’ or changes of population (FI USPTO from 2000 onward)

11


Data transimission delays ii

Data Transimission delays (II)

12


Completeness of geographic data

Completeness of geographic data

Table for the TOP 20 by inventor count;

13 authorities have more than 80% of records with no country code;

12 authorities have 0% of address/city;

Anyway in many cases address data are inside first name field (FI: DE)

(data from patstat 09/2009)

13


Conclusions

Conclusions

Non EPO havecoverage, quality and ‘spelling’ thatmaychange a lotfrompatent authority topatent authority;

Data can beusedasaddictional source of information butnotasmain source (BONUS not MALUS);

EPO couldprobablyimprovequalityofthis data, especiallyadd more addresses (FI in april 2011 willrelease WO address data) is up tousersdemand more on thistopic.

14


  • Login