Context
This dataset is provided for the U.S. Patent Phrase to Phrase Matching competition. It adds additional information by providing the meaning of each code in the context
column.
For more info, check out the discussion thread, and take a look at this starter notebook to see how to incorporate the data into the competition.
Content
Preprocessing script here: https://www.kaggle.com/code/xhlulu/download-and-process-cpc
Licensing
This data can be found on the USPTO website, where you can find the copyright information:
Pursuant to federal law, most government-produced materials appearing on this website are not subject to copyright restrictions within the United States and are therefore in the public domain. Public domain information may be freely distributed and copied, but it is requested that in any subsequent use the United States Patent and Trademark Office (USPTO) be given appropriate acknowledgement (e.g., “Source: United States Patent and Trademark Office, www.uspto.gov”). The USPTO reserves the right to assert copyright protection internationally.
Acknowledgements
Photo by 2H Media on Unsplash