TChan/Notebook/2007-4-16: Difference between revisions

From OpenWetWare
Jump to navigationJump to search
TChan (talk | contribs)
No edit summary
TChan (talk | contribs)
No edit summary
Line 1: Line 1:
==New Plan==
==New Plan==


* INPUT: string of search-ready disease name or associated gene, ex. 'BRCA1', 'Hashimoto's Thyroiditis'
* '''INPUT''': string of search-ready disease name or associated gene, ex. 'BRCA1', 'Hashimoto's Thyroiditis'
* OUTPUT: list (of lists) of 1) base site name of 2) searched-URLs for the disease/gene
* '''OUTPUT''': list (of lists) of 1) base site name of 2) searched-URLs for the disease/gene
 
 


===Sites to be Searched===
===Sites to be Searched===
Line 12: Line 10:
** Wikipedia
** Wikipedia
** (WHO)
** (WHO)


* '''Less Patient-Friendly But Possibly Useful Info''':
* '''Less Patient-Friendly But Possibly Useful Info''':
Line 18: Line 15:
** OMIM
** OMIM
** GeneCards
** GeneCards




Line 24: Line 20:
# Parse the search-term for individual sites' search URLs
# Parse the search-term for individual sites' search URLs
# Return the search-URL + parsed-search-terms
# Return the search-URL + parsed-search-terms


===Parsing===
===Parsing===
Line 32: Line 29:
** lowercase
** lowercase
* Thus, will need to check how each site handles this, in addition to fitting it within the search-URLs
* Thus, will need to check how each site handles this, in addition to fitting it within the search-URLs
* Will test each site using search-string:
* Will test each site using search-string: <code>"Hashimoto's Thyroiditis"</code>
<pre>"Hashimoto's Thyroiditis"</pre>


====eMedicine====
====eMedicine====
* Tested URL:
* Tested URL:
<pre> http://www.emedicine.com/cgi-bin/foxweb.exe/searchengine@/em/searchengine?boolean=and&book=all&maxhits=40&HiddenURL=&query=hashimoto's%20thyroiditis
<pre>http://www.emedicine.com/cgi-bin/foxweb.exe/searchengine@/em/searchengine?boolean=and&book=all&maxhits=40&HiddenURL=&query=hashimoto's%20thyroiditis</pre>
</pre>
* Case:
* Case:
** lower
** lower
* Space:
* Space:
** <pre>%20</pre>
** replaced with<code>%20</code>
* Apostrophe:
* Apostrophe:
** left in where it was
** left in where it was
* Location
# <pre>http://www.emedicine.com/cgi-bin/foxweb.exe/searchengine@/em/searchengine?boolean=and&book=all&maxhits=40&HiddenURL=&query=</pre>
# term, with replacements
====Google (General Search)====
* Google has its general search, as well as a "Treatment" search with more specific information
=====General=====
* Tested URL
<pre>http://www.google.com/search?hl=en&q=hashimoto%27s+thyroiditis&btnG=Search</pre>
* Case:
** lower
* Space:
** replaced with <code>+</code>
* Apostrophe:
** replaced with <code>%27</code>
* Location:
# <pre>http://www.google.com/search?hl=en&q=</pre>
# term, with replacements
# <pre>&btnG=Search</pre>
=====Treatment-Specific=====
* Tested URL    http://www.google.com/search?hl=en&q=hashimoto%27s+thyroiditis+more:condition_treatment&cx=disease_for_patients&sa=N&oi=cooptsr&resnum=0&ct=col1&cd=1
* Case:
** lower
* Space:
** replaced with <code>+</code>
* Apostrophe:
** replaced with <code>%27</code>
* Location:
# <pre>http://www.google.com/search?hl=en&q=</pre>
# term, with replacements
# <pre>+more:condition_treatment&cx=disease_for_patients&sa=N&oi=cooptsr&resnum=0&ct=col1&cd=1</pre>

Revision as of 02:01, 17 April 2007

New Plan

  • INPUT: string of search-ready disease name or associated gene, ex. 'BRCA1', 'Hashimoto's Thyroiditis'
  • OUTPUT: list (of lists) of 1) base site name of 2) searched-URLs for the disease/gene

Sites to be Searched

  • General Patient Info
    • eMedicine
    • Google ('more:condition_treatment' is default)
    • Wikipedia
    • (WHO)
  • Less Patient-Friendly But Possibly Useful Info:
    • HapMap
    • OMIM
    • GeneCards


Tasks

  1. Parse the search-term for individual sites' search URLs
  2. Return the search-URL + parsed-search-terms


Parsing

  • Characters in the search-term will be:
    • alpha
    • apostrophe
    • blank-space
    • lowercase
  • Thus, will need to check how each site handles this, in addition to fitting it within the search-URLs
  • Will test each site using search-string: "Hashimoto's Thyroiditis"

eMedicine

  • Tested URL:
http://www.emedicine.com/cgi-bin/foxweb.exe/searchengine@/em/searchengine?boolean=and&book=all&maxhits=40&HiddenURL=&query=hashimoto's%20thyroiditis
  • Case:
    • lower
  • Space:
    • replaced with%20
  • Apostrophe:
    • left in where it was
  • Location
  1. http://www.emedicine.com/cgi-bin/foxweb.exe/searchengine@/em/searchengine?boolean=and&book=all&maxhits=40&HiddenURL=&query=
  2. term, with replacements

Google (General Search)

  • Google has its general search, as well as a "Treatment" search with more specific information
General
  • Tested URL
http://www.google.com/search?hl=en&q=hashimoto%27s+thyroiditis&btnG=Search
  • Case:
    • lower
  • Space:
    • replaced with +
  • Apostrophe:
    • replaced with %27
  • Location:
  1. http://www.google.com/search?hl=en&q=
  2. term, with replacements
  3. &btnG=Search
Treatment-Specific
  1. http://www.google.com/search?hl=en&q=
  2. term, with replacements
  3. +more:condition_treatment&cx=disease_for_patients&sa=N&oi=cooptsr&resnum=0&ct=col1&cd=1