Difference between revisions of "Data Validation"
(Created page with "'''Introduction''' This section contains information on validating data for various bioinformatics tasks. == OSUC Database == === Invalid Locality Detection and Correction ===...") |
|||
Line 2: | Line 2: | ||
This section contains information on validating data for various bioinformatics tasks. | This section contains information on validating data for various bioinformatics tasks. | ||
− | |||
== OSUC Database == | == OSUC Database == |
Revision as of 14:56, 21 October 2011
Introduction
This section contains information on validating data for various bioinformatics tasks.
OSUC Database
Invalid Locality Detection and Correction
Using the Data_Validator package, run the listInvalidLocalities procedure to correct locality coordinates for localities within US states and list invalid coordinates for countries.
US State-level locality correction begins by gathering all of the localities within each US state that is absent of coordinates. After gathering the offending localities, the midpoint coordinate for the state is obtained from geonames and set to all of the localities within that state without coordinates.
Country-level invalid locality detection is accomplished by obtaining the bounding box region of a particular country from geonames then gathering all of the localities within that country. When a coordinate for a locality is beyond the bounding box region of a country, the locality name is displayed for manual correction.
Click here for a schedule of past and future runs of invalid locality detection and correction.