ANN: NCBI EUtils client
Andrew Dalke
dalke@dalkescientific.com
Thu, 16 Jan 2003 11:14:48 -0700
We have developed a client-side interface to NCBI's "EUtils" server,
which is a web service for querying their Entrez databases, including
GenBank and PubMed.
It is available from http://www.dalkescientific.com/EUtils/
and distributed under the Biopython license,
http://www.biopython.org/License.shtml
Example:
Get all protein sequences related to protein GI:4579714
>>> import EUtils
>>> from EUtils import HistoryClient
>>> client = HistoryClient.HistoryClient()
>>> result = client.post(EUtils.DBIds("protein", "4579714"))
>>> related = result.neighbor_links("protein")
>>> related_dbids = related.linksetdbs["protein_protein"].dbids
>>> proteins = client.post(related_dbids)
>>> len(proteins)
223
>>> infile = proteins.efetch(retmode = "text", rettype = "fasta")
>>>
>>> fasta = infile.read()
>>> print fasta[:788]
>gi|27450749|gb|AAO14677.1|AF508258_1 rhodopsin [Pyrocystis lunula]
MAPIPDGFTYGQWSLVYNSLSFGIAGMGCATIFFWLQLPNVSKSYRTALTITGLVTAIATYHYVRIFNSW
VDAFKVVNVNGGDYTVTLLGAPFNDAYRYVDWLLTVPLLLIELILVMKLPKAETVKLSWNLGVASAVMVA
LGYPGEIQDDLLVRWFWWAMAMIPFYYVVVTLVNGLSDATAKQPDSVKSLVVTARYLTVISWLTYPGVYI
IKSMGLAGNIATTYEQVGYSVADVVAKAVFGVLIWAIAAGKSDEEEKNGLLG
>gi|6319528|ref|NP_009610.1| Homolog to HSP30 heat shock protein Yro1p;
Yro2p [Saccharomyces cerevisiae]
MSDYVELLKRGGNEAIKINPPTGADFHITSRGSDWLFTVFCVNLLFGVILVPLMFRKPVKDRFVYYTAIA
PNLFMSIAYFTMASNLGWIPVRAKYNHVQTSTQKEHPGYRQIFYARYVGWFLAFPWPIIQMSLLGGTPLW
QIAFNVGMTEIFTVCWLIAACVHSTYKWGYYTIGIGAAIVVCISLMTTTFNLVKARGKDVSNVFITFMSV
IMFLWLIAYPTCFGITDGGNVLQPDSATIFYGIIDLLILSILPVLFMPLANYLGIERLGLIFDEEPAEHV
GPVAEKKMPSPASFKSSDSDSSIKEKLKLKKKHKKDKKKAKKAKKAKKAKKAQEEEEDVATDSE
>>>
Andrew Dalke
dalke@dalkescientific.com
--
Need usable, robust software for bioinformatics or chemical
informatics? Want to integrate your different tools so you can
do more science in less time? Contact us!
http://www.dalkescientific.com/