GenomeQuest‘s database GQ-Pat has crossed the 300 million sequences milestone, the company announced in a blog post, making it the largest private or public biological sequence database on Earth.
The database now houses 256 million nucleotide sequences and more than 45 million protein sequences. The sequences are not simply automated translations of nucleotides like TrEMBL but are garnered from patents and patent applications published by patent authorities across the globe.
As of 2015, the number of nucleotide sequences in GenBank/EMBL/DDBJ consortium is 185 million, GenomeQuest pointed out.
For a more detailed report, click here.
This post was contributed by Abhishek Tiwari. The Intellogist blog and Intellogist are provided for free by Landon IP, which is a CPA Global company. Landon IP is a major provider of professional services meeting the needs of the IP community, including patent searches; analytics and technology consulting; patent, legal, and technical translations; and information research and retrieval.