updated pattrack-server
Jonas Bosson
jonas at illuminet.se
Thu May 24 14:40:57 CEST 2007
I updated the patent publication tracker, that is, a key-value-table not
used yet that updates publication status and numbers from i.e:
https://publications.european-patent-office.org/PublicationServer/resultlist.jsp?iAction=2&date=20070110
...it fetches each publication day.
Apparently the fetcher had stopped working after some changes in the
page structure at EPO in 2007-01. I am fetching the changes to complete
the missing data we have on publications from 1980->now.
The publications information that helps us fetch relevant patents using
the EPO= iDocId internal id:
https://publications.european-patent-office.org/PublicationServer/xmldocument?docId=6160450
A new fetcher for xmldocumets using this info needs to built into gauss.
As a start I would like to dump all current patents from the db into a
file structure much like the USPTO-patent DVD has. That is, in html
files like /EP/YYYY/XXXXXXXX.gz where XXXXXXXXX is patent number and
YYYY is XXXXXXXX/1000. It would be faster and easier to manage.
Note that most EPO pages need a live session started before going for
the meat. So don't be surprised if the pages are not displayed at the
very first try.
best,
jonas
More information about the Gauss-parl
mailing list