Big Data/Analytics Zone is brought to you in partnership with:

Usability is Matt Overstreet’s mission. He has worked with Federal, Fortune 500, and small businesses to help collect, mine and interact with data. When solving a problem he synthesizes experience from a liberal arts and technical background. Matt is a DZone MVB and is not an employee of DZone and has posted 3 posts at DZone. View Full User Profile

Finding Relationships in Trademark Data

07.01.2013
| 1901 views |
  • submit to reddit

At the recent National Day of Civic hacking here at OSC we dug into a few ways to find relationships between Trademarks files with the USPTO. 

If you’ve ever played with the US trademark data you’ll know that it’s both plentiful and scarce. There are lots of trademark fillings, each with the minimum possible data to make them uniquely identifiable.

That’s great for streamlined government and citizen anonymity, but no fun for finding the relationships between filings. We needed to suss out more information about the graph of trademarks.  That’s when we Eric and Wes tripped over the translations included in many of the patent filings.  We wondered if the term space for these translations might be smaller and more consistent then the space defined by the actual trademarks.  Translations were less likely to play games with spelling or grammar the way one might with the actual mark. 

Some Hacking with the data and Neo4j resulted in an intriguing dataset that we are still unpacking.  Want to play with the data? Neo4J loaded with data is at this url: http://rosetta.bloom.sh:7474/webadmin/ 

 

Published at DZone with permission of Matt Overstreet, author and DZone MVB. (source)

(Note: Opinions expressed in this article and its replies are the opinions of their respective authors and not those of DZone, Inc.)