Latest revision as of 20:37, 15 May 2016

GenGIS provides the following Python plugins which can be accessed through the Data and Plugins menus. Please contact us if you have questions about using the plugins, or if you have suggestions for new plugins.

Data Retrieval

We are currently developing plugins to retrieve data from several online sources. In all cases, we ask that you familiarize yourself with the relevant Terms of Use and any Disclaimers regarding use of the data; we link to these below wherever possible. The developers of GenGIS are in no way responsible for data provided by third-party sources, and are not liable for any consequences arising from the use of our software and plugins.

GBIF Query

The GBIF Query plugin creates location and sequence data for use in GenGIS from the Global Biodiversity Information Facility (GBIF). It queries the GBIF database with one or more user-provided taxon names and a geographic range, and returns all instances with geographic location data that match the query. When using the GBIF plugin to create datasets, please read and adhere to GBIF's Data Use Agreement and Data Sharing Agreements. Our plugin makes use of the GBIF public API, which is still somewhat in flux - please let us know if you encounter any problems.

For the purposes of the next example, it is assumed that the user has loaded a Raster Map file. If a user has not loaded a Raster Map then Add Data will not be available, but the retrieved data can still be saved to disk.

Please be aware that large queries may result in the plugin entering a Not Responding state. This is controlled by the operating system, and while the plugin will not respond to user input, it is still performing its query.

Furthermore there may be cases where the count returned from Query Records may not exactly match the amount of records returned from the plugin. This is because GBIF occasionally will return results slightly outside of a specified range. To procure these samples as well it is often enough to adjust the geographic border to the next largest integer.

GBIF Query plugin.

Step 1: The Query

In order to query GBIF two things must be entered: a taxon name and a geographic range. If a map is loaded prior to running this plugin the default range borders will be the extents of the map; if not they will be the entire world. The geographic range can be fine tuned using either text input or the scroll wheels. After the appropriate information has been entered hitting Search will query GBIF for all possible taxonomic matches.

Look for taxon instances.

Step 2: Add/Remove Items

Hitting Search populates the Results Table. This is where all matches are returned by GBIF:

Unique ID Number | Full Name | Biological Classification | Data Source

Highlighting entries in this list and clicking Add or double-clicking entries adds them to the ID List. This list is what will be used to query GBIF to create the location and sequence file. Highlighting an entry in this list and clicking Remove or double-clicking an entry removes it from consideration. A user can perform multiple queries and add multiple taxa to this list, but only one geographic range can be defined.

Prepare data to be queried.

Step 3: Retrieve Data/Query Records

Once the user is satisfied with the contents of the ID List they can choose either Retrieve Data or Query Records. Query Records quickly retrieves the number of results without retrieving the results themselves; this can be used to quickly determine whether the size of the data set will be suitable for use in GenGIS. This information is displayed in the Summary dialog box. Large data sets (e.g., >1000 locations) will take more time to retrieve and process, as well as slow down GenGIS. If the user is satisfied with the amount of records they are about to retrieve they can move on to the Retrieve Data option. Here GBIF is queried and the progress of that query is displayed in the Progress box.

Output from 'Calculate'.

Step 4: Add/Export Data

Finally the user can choose to export their data to a location on their disk drive, or add it directly to GenGIS. The Export button writes three separate files to a user-specified location on disk. These files are the location file, sequence file and a source file containing collection metadata for the data set, any specialized rights associated with that data, and how to cite them for published works. Saving data in files eliminates the need to redo lengthy queries at a later date. If Add Data is selected then the location and sequence files are added directly to GenGIS without saving. The source information is imported into the description of the location layer.

Data added to GenGIS.

MG-RAST Query

The MG-RAST Query plugin creates location and sequence data for use in GenGIS from the RAST (MG-RAST) Server. It queries the MG-RAST database with a user-provided organism or function located within a geographic range and returns contents of associated studies to be used in GenGIS.

For the purposes of the next example, it is assumed that the user has loaded a Raster Map file. If a user has not loaded a Raster Map then Add Data will not be available, but the retrieved data can still be saved to disk.

Please be aware that large queries may result in the plugin entering a Not Responding state. This is controlled by the operating system, and while the plugin will not respond to user input, it is still performing its query. Also, the MG-RAST service has occasional periods where it is not available, which will generate errors when using the GenGIS plugin.