Harvard:Biophysics 101/2007/Notebook:Denizkural/2007-4-30
From OpenWetWare
Jump to navigationJump to search
May 1:
Documenting my function:
input variables: - a keyword or an ordered set of keywords (one variable - a string) - the name of the FULL url of the feed, or a feed name that comes 'preloaded' (string) - a positive integer n, which will be the number of items to be returned
- The keywords will be used, if possible, to search for the relevant articles/posts/results in a given source/feed.
- The integer n is the number of news articles, clinical trials, and such to be returned... it might have an upper limit imposed by the source.
- I will leave the possibility of including new feeds open, thus the script accepts new URLS, but you could just use the pre-existing ones for clinical trials, news, etc.
Until May 3:
Work with Cynthia, Resmi, and Zack to figure out the integration of mesh-terms. Description of algorithm:
- Pick the more specific terms
- Get rid of common words (using a simple library and taking a string array set difference)
- Do a search using the terms, which will return 'condition' (Z's algorithm)
- Use the existing algorithm to get CDC data.
I would like to get together with everyone to actually test this out to see how it works.