I’m going to let you in on a little secret: there’s a reason why we are the market leader in patent sequence search. It has surprisingly little to do with our user-friendly search interface, our stellar customer support, or our good looks. While (at least some of) these things certainly help, it is the content that can only be found in our GQ-Pat database of patent sequences that makes the real difference. Think all patent sequence databases are the same? Let me explain what I mean in some more detail.
When, for example, a life science patent application is filed at the USPTO they ask that the inventor put all sequences into a nicely formatted list. This so-called “ST.25 listing” helps the examiners with their workflow and makes it straightforward to collect all sequences submitted to the office over time. In an ideal world, every inventor and every patent office would list sequences like this and that would be the end of it.
Unfortunately, very few patent offices in the world actually have official sequence-filing rules, and even when they do have them, they’re frequently ignored. As a result, sequences can be found anywhere in a patent: inside the text, in tables, or even as part of the figures. If your search only spans the ST.25 sequence listings, you’re going to miss out on a lot of them!
You might be wondering, “so how big is this problem, really? How many sequences can be found outside of the official ST.25 listings?”
I’m afraid that the answer is “a whole lot”. If you don’t know what you’re doing, you can easily miss more than 38% of US, WO, EP, JP, KR patent documents with sequences in them. How sure are we that this is the right number? The number is based on a massive amount of internal sequence data, that we confirmed by comparing the number of PNs with products based on ST.25 listings like US Gene, WO Gene, and PatSeq.
Sure, anyone can download ST.25 sequence listings, and index them for BLAST search. But this approach will cause you to miss out on real, critical-to-your-company, sequences. Why? Because all those sequences that are located in text, tables, and figures aren’t indexed!
Still not convinced that you’re missing documents that are relevant to you? Here’s a list with some of the largest patent assignees in our patent sequence database, and the percentage of patent applications they filed with sequences hidden in the text, tables, and figures. These are all documents that would never be found anywhere except in the GQ-Pat database.
|Patent Assignee (normalized)||Patents filed with sequences hidden in text, tables, and figures.|
|JOHNSON & JOHNSON||26.12%|
|MERCK SHARP DOME||23.50%|
|BRISTOL MYERS SQUIBB||20.07%|
How does GQ Life Sciences make sure these documents are found? Over the last five years, we have invested millions of dollars and countless hours to find every last sequence that is out there. We use proprietary algorithms to flag documents with even a minuscule chance of containing sequences. That set is then manually curated to capture all of the sequences in them. Our human curators also verify additional information like the SEQ ID NO and whether a sequence is mentioned in the claims or not. We have examined the entire backfile of over 100 million historical patents for sequences.
Of course, we haven’t stopped there – we continuously process the new patents that are being published to ensure our patent database contains the most up-to-date sequence information available. In addition to 38% more US, WO, EP, JP, KR patents, we have also indexed 153,000 documents from authorities in China, Canada, India, and Brazil that you will not easily find anywhere else.
So while it’s the content of our database that makes us the Patent Sequence Search leader, it’s our people that make sure customers get the most out of it. If you’ve never worked with us and think it’s time to give us a try, sign up for a free trial account today and you’ll get access to a team of application scientists that can show you how to get better results in less time.