Giter VIP home page Giter VIP logo

p2n-v3's Introduction

  _____      _             _     ___    _   _      _   
 |  __ \    | |           | |   |__ \  | \ | |    | |  
 | |__) |_ _| |_ ___ _ __ | |_     ) | |  \| | ___| |_ 
 |  ___/ _` | __/ _ \ '_ \| __|   / /  | . ` |/ _ \ __|
 | |  | (_| | ||  __/ | | | |_   / /_  | |\  |  __/ |_ 
 |_|   \__,_|\__\___|_| |_|\__| |____| |_| \_|\___|\__|

 This repository is depregated... use instead https://github.com/Patent2net/P2N
 
About
°°°°°
Patent2Net is elaborated and maintained (on a free base) by a small international team of professors and researchers
Patent2Net is a "free" package, dedicated to :
	augment the use of patent information in academic, nano and small firms, developing countries (all those without pay mode access)
	learn, study and practice how to collect, treat and communicate "textual bibliographic information", and automation process 
	provide statistical analysis and representations of a set of patents.
	
Patent2Net is an "open source" package and contributions are welcome.
Patent2Net is available "as it is".

First step : Train how to search patent information using interface
°°°°°°°°°°°°°°°°°°°°°°°°°°°°°°°°°°°°°°°°°°°°°°°°°°°°°°°°°°°°°°°°°°°

http://worldwide.espacenet.com/advancedSearch?locale=en_EP

Practice and use the available help :
http://worldwide.espacenet.com/help?locale=en_EP&method=handleHelpTopic&topic=index
more especially : 
Full-text search
	http://worldwide.espacenet.com/help?locale=en_EP&method=handleHelpTopic&topic=<fulltext></fulltext>
boolean operators
	http://worldwide.espacenet.com/help?locale=en_EP&method=handleHelpTopic&topic=booleans
Truncation
	http://worldwide.espacenet.com/help?locale=en_EP&method=handleHelpTopic&topic=truncation
Smart search - field identifiers
	http://worldwide.espacenet.com/help?locale=en_EP&method=handleHelpTopic&topic=fieldidentifier
Limitations
	http://worldwide.espacenet.com/help?locale=en_EP&method=handleHelpTopic&topic=limitations
Date formats and Ranges
	http://worldwide.espacenet.com/help?locale=en_EP&method=handleHelpTopic&topic=dateformats
Kind Codes
	http://worldwide.espacenet.com/help?locale=en_EP&method=handleHelpTopic&topic=kindcodes
Respect the Fair use Charter for the EPO's online patent information products
	http://www.epo.org/searching/free/fair-use.html

Install Patent2Net package
°°°°°°°°°°°°°°°°°°°°°°°°°°
To run as python script need to install python and some libraries :

Install python 2.7 x86  from https://www.python.org/
Actualise the "path" to python My Computer > Properties > Advanced System Settings > Environment Variables >
Reboot
Install pip http://www.pip-installer.org/en/latest/installing.html
Install requests library : in "C:\Python27\Scripts" open a command windows and run "pip install requests"
Install networkx library : finding the good way to install in http://networkx.github.io/documentation/latest/install.html
Install epo-ops client from gsong on github : https://github.com/55minutes/python-epo-ops-client

Download Patente2Net from https://github.com/Patent2net/Patent2Net unpack where you want on your disk
Use the current version as above

setup.py is provided also to compile binaries for your windows operating system. Use it with the command python setup.py py2exe to produce binaries in the dist directory.
To run as a an exe file (windows) : need to install the "full package", updating can be done only copying the "Patent2Net" files. Operate as a "Command window"

Use Patent2Net
°°°°°°°°°°°°°°

0. Edit the file OpsGatherPatentsV2.py and adapt in line 31 the path to you epo accreditation couple: just put in an ASCII file key, password on the same line. 
The authenticated credits are obtained from OPS, registering and following instructions. 

0.1 In the directory make the following directories:
	PatentLists 		# will contain files of list of patents
	PatentsBiblio 		# will contain files of bibliographic data
	GephiFilesV5		# will contain gephi files, why V5? Good question my friend.
Open a windows command in your Patent2Net directory	
1 - Construct the patent list and download their bibliography data with OpsGatherPatentsV2.py
	Use as : OpsGatherPatentsV2.py Your_File "search expression"
	For example : OpsGatherPatentsV2.py test "ti =(stereolithography or \"3D print*\") AND pd<=1996"
	Protect the " with a \ Should obtain at minimum as many patents as with the URL interface.
	Results are stored in the "PatentLists" directory within the dump file in pickle format.
Note : OpsGatherPatentsV2.py will gather patents list AND bibliographic data ;-). 
	A new directory "Abstracts" in PatentsBiblio will be created. Guess what it contains. 

2 - convert to gephi (gexf) file : PatentsToNet-V5.py Your_File.
	The file is the file produced on stage 1 (without path, PatentsBiblio directory is the default source)
	Result is stored in "GephiFile" directory as a Your_File.gexf.
	Many more features and caracteristics: citations, activeness, and other metrics avalaible. 

3 - use the couple OPSGatherAugment-Families to augment Bibliographics data files (created on step one) to the whole families of each patents. 
New caracteristics on DYNAMIC and HIERACHIC network will be hence produced thrue PatentsToNet-Families

4 - download claims, descriptions, and fulltext if available using OPSGatherContentsv1  (optional, not consistent at this time) : OPSGatherContentsv1.py Your_File
	The file is the file produced on stage 1 (without path, patentlists directory is the default source)
	Available claims (mainly European Patents EP) will be downloaded and stored "claims" directory


Todo List V 1.0 (Beta) 30/10/2014:
	- Future development will add scenaris of analysis (one scenary, one network e.G authors, applicants etc. to avoid the need of Gephi expert's skills)
	- revisiting weight nodes on networks
	- check abstracts gathering (seems lack of content)
	- complete content gathering
	- clean unused function and code everywhere ^_^ 




V 0.9 26/03/2014 :
°°°°°°°°°°°
	URL Links for IPC 7 and 11
	URL Links for inventors or applicant (to evaluate what they are doing out of search field, in general)
	Grouping results in Patent Family. 
	Other Gathering process list for family requests
	Susbscribed version. 
	Patent Citation
	Revisit weights.
	
V 0.9 26/03/2014
°°°°°°°°°°°°°°°°
	OpsGather-PatentList
		-Accept an Espacenet "smart search" query
	PatentsToNet
		-fully connected graph is provided in Gephi, connecting any relation (intra and Inter field) : filtering can be done in Gephi or hacking in the Python script. 
		-International Patent Classification is treated to be "truncated" at level 1,3,4,7. nodes for each level are created
		-Countries from Patent numbers (first deposit?) are considered as nodes
		-Kind codes (status) are separated as nodes
		-URL links as node attribute in gexf 
			for patent number : link to espacenet
			for International Patent Classification IPC at level 1,3,4 : link to IPC database (French and English)
		-Dynamic graph are available over first available date (column "deb" and "fin" as to be merged as timeline for nodes and edges in data laboratoty in Gephi)
		- directed graph is build complete in bidirection mode: Inventor-Inventor; IPC-IPC; Applicant-Applicant, and all combinations
		-Weight of nodes are provided as....
		-Weight among time are provided for node as ...
		-Weight of edges are provided as ....

p2n-v3's People

Contributors

alarictabaries avatar aziguy avatar e-bara avatar nezhacherrabi avatar patent2net avatar quoniam avatar romaric-g avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

p2n-v3's Issues

Maximum number of extracted entries

To whom it may concern,

I was trying to extract a larger amount of patents using P2N-v3 which appears to be a great package! Thank you very much for coding it by the way!
However, there seems to be a maximum number of 2,000 entries that can be extracted. How can I increase this number manually? Where do I find the particular file?

I am really looking forward to your response.

Best regards,
Philipp

Windows Installation

Hey,

I have a problem in regards of the installation on windows of Patent2Net-v3.

In my case Anacadona is installed in C:\Users\Username\anaconda3\ and the P2N directory is in C:\Users\Username\P2N-v3-master. However I have major issues adopting the installP2N.bat code to this directory structure.
Furthermore the installP2N.bat calls for a "INSTALLP2N2.bat" even though the current directory only has a "installP2N.bat"

Maybe a common issue faced in the initial installation process.

question about the

Dear Sir/Madam,
I have installed P2N-v3 successfully according to the installation instruction. However, when I run "runp2nsingle.bat" containing the following commands:
conda activate P2N & p2n run --config=../banana_peel.cql --with-family

some error messages appear as follows:
2019-07-11 19:01:55,861 [p2n.util ] INFO : Running command "python OPSGatherPatentsv2.py ../banana_peel.cql"
Traceback (most recent call last):
File "OPSGatherPatentsv2.py", line 71, in
configFile = LoadConfig()
File "C:\Users\rdwang\Anaconda3\envs\P2N\lib\site-packages\patent2net-3.0.0.dev6-py3.6.egg\Patent2Net\P2N_Config.py", line 148, in LoadConfig
return P2NConfig()
File "C:\Users\rdwang\Anaconda3\envs\P2N\lib\site-packages\patent2net-3.0.0.dev6-py3.6.egg\Patent2Net\P2N_Config.py", line 41, in init
content = self.readInputFile()
File "C:\Users\rdwang\Anaconda3\envs\P2N\lib\site-packages\patent2net-3.0.0.dev6-py3.6.egg\Patent2Net\P2N_Config.py", line 107, in readInputFile
return open(arg, "r").readlines()
FileNotFoundError: [Errno 2] No such file or directory: '../banana_peel.cql'
2019-07-11 19:01:56,863 [p2n.util ] ERROR : Command "python OPSGatherPatentsv2.py ../banana_peel.cql" failed with return code 1

I'd appreciate if you can let me know what wrong with it. Thank you.

Ray

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.