Recently the Intellogist blog discussed GenomeQuest as a source for searching patent sequence data. This month GenomeQuest announced that they are adding Chinese patent sequence data to their collection. The company is adding sequences filed at the Chinese Patent Office (SIPO). The GenomeQuest press release explains that 40,000 sequences from over 5,000 Chinese patents have already been indexed into their GQ-IP product, which contains sequence information from patent collections and public sources such as GenBank, EMBL, and DDBJ.
Another well-known source of international patent sequence data is GeneSeq, a database produced by Thomson Reuters during the indexing of the Derwent World Patents Index (DWPI) file. Although it does contain sequences indexed from Chinese patents, the GenomeQuest data is likely being produced through a separate indexing process, and so should be considered a unique collection.
Discussing one sequence search product always brings on the mention of another, since to conduct a comprehensive patent sequence search, industry professionals must consult multiple unique data sources. This type of search is very challenging, so to promote community knowledge-sharing Intellogist features a Best Practices wiki on biotechnology searching, which also contains a discussion of sequence searching. The article discusses other major public and commercial sequence search sources, as well as search strategies for this type of investigation. Any user who completes a quick and free registration process on Intellogist can add to this knowledge collection.
What are your insights on sequence searching tools and techniques? Let us know in the comments, or better yet, on the Biotechnology Searching Best Practices page!
This post was contributed by Intellogist Team member Kristin Whitman. The Intellogist blog is provided for free by Intellogist’s parent company, Landon IP, a major provider of patent search, technical translation, and information services.