TChan/Notebook/2007-4-16: Difference between revisions
From OpenWetWare
Jump to navigationJump to search
No edit summary |
|||
Line 7: | Line 7: | ||
* '''General Patient Info''' | * '''General Patient Info''' | ||
** eMedicine | ** eMedicine | ||
** Google | ** Google | ||
** Wikipedia | ** Wikipedia | ||
** (WHO) | ** (WHO) |
Revision as of 20:05, 16 April 2007
New Plan
- INPUT: string of search-ready disease name or associated gene, ex. 'BRCA1', 'Hashimoto's Thyroiditis'
- OUTPUT: list (of lists) of 1) base site name of 2) searched-URLs for the disease/gene
Sites to be Searched
- General Patient Info
- eMedicine
- Wikipedia
- (WHO)
- Less Patient-Friendly But Possibly Useful Info:
- HapMap
- OMIM
- GeneCards
Tasks
- Parse the search-term for individual sites' search URLs
- Return the search-URL + parsed-search-terms
Code
import sys # (Temporary) search_term will get whatever the input is search_term = "Hashimoto's Thyroiditis" def parse_for_eMed(search_term): parsed_term = search_term.lower().replace(' ', '%20') return "http://www.emedicine.com/cgi-bin/foxweb.exe/searchengine@/em/searchengine?boolean=and&book=all&maxhits=40&HiddenURL=&query=%s" % parsed_term def parse_for_Google_genl(search_term): parsed_term = search_term.lower().replace("'", '%27').replace(' ', '+') return "http://www.google.com/search?hl=en&q=%s&btnG=Search" % parsed_term def parse_for_Google_treatment(search_term): parsed_term = search_term.lower().replace("'", '%27').replace(' ', '+') return "http://www.google.com/search?hl=en&q=%s+more:condition_treatment&cx=disease_for_patients&sa=N&oi=cooptsr&resnum=0&ct=col1&cd=1" % parsed_term def parse_for_Wikipedia(search_term): parsed_term = search_term.lower().capitalize().replace("'", '%27').replace(' ', '_') return "http://en.wikipedia.org/wiki/%s" % parsed_term def return_site_list_for_disease(search_term): # Currently returns site-name and URL list # ex. [["eMedicine", "http://www.emedicine.com/cgi-bin/foxweb.exe/searchengine@/em/searchengine?boolean=and&book=all&maxhits=40&HiddenURL=&query=parsed-term"]] return [["eMedicine", parse_for_eMed(search_term)], ["Google, general search", parse_for_Google_genl(search_term)], ["Google, Treatment search", parse_for_Google_treatment(search_term)], ["Wikipedia", parse_for_Wikipedia(search_term)]] final_list = return_site_list_for_disease(search_term) print final_list
Next Steps
- Accessing other, not-as-easily-accessible sites: PubMed, OMIM, HapMap, GeneCards