updated pattrack-server

Jonas Bosson jonas at illuminet.se
Thu May 24 14:40:57 CEST 2007


I updated the patent publication tracker, that is, a key-value-table not
used yet that updates publication status and numbers from i.e:

https://publications.european-patent-office.org/PublicationServer/resultlist.jsp?iAction=2&date=20070110

...it fetches each publication day.

Apparently the fetcher had stopped working after some changes in the
page structure at EPO in 2007-01. I am fetching the changes to complete
the missing data we have on publications from 1980->now.

The publications information that helps us fetch relevant patents using
the EPO= iDocId internal id:

https://publications.european-patent-office.org/PublicationServer/xmldocument?docId=6160450

A new fetcher for xmldocumets using this info needs to built into gauss.
As a start I would like to dump all current patents from the db into a
file structure much like the USPTO-patent DVD has. That is, in html
files like /EP/YYYY/XXXXXXXX.gz where XXXXXXXXX is patent number and
YYYY is XXXXXXXX/1000. It would be faster and easier to manage.

Note that most EPO pages need a live session started before going for
the meat. So don't be surprised if the pages are not displayed at the
very first try.

best,
jonas



More information about the Gauss-parl mailing list