HowTo Geolocate NST API Reference: Difference between revisions
(Working on interpage links) |
|||
Line 365: | Line 365: | ||
* Get a list of IPv4 addresses you would like to geolocate. | * Get a list of IPv4 addresses you would like to geolocate. | ||
* Use the '''nstipgeolocate''' command or '''ipgeolocate-ajax.php''' web service to convert to XML (the ''xmlext'' format as detailed above). | * Use the '''nstipgeolocate''' command or '''ipgeolocate-ajax.php''' web service to convert to XML (the ''xmlext'' format as detailed in the "[[#ASCII_Output_API_.28nstipgeolocate.29]]" and "[[#ASCII Output As A Service]]" sections above). | ||
* Apply a XSL style sheet to convert the XML file into a KML file. | * Apply a XSL style sheet to convert the XML file into a KML file. | ||
* Open the KML file in Google Earth. | * Open the KML file in Google Earth. |
Revision as of 19:04, 4 May 2011
NST Geolocation API Overview
While the NST system provides many different and specialized geolocation renderings, it also provides a generic set of APIs to allow you to plot IPv4 addresses on a map. In general, this is accomplished as follows:
- You set up your NST system such that is enabled to geolocate IPv4 addresses.
- You produce a file (or stream) containing a list of IPv4 addresses.
- You feed your list of IPv4 addresses into one of the NST rendering engines.
- You get a rendering of your geolocated IPv4 addresses.
NST Geolocation APIs
ASCII Output API (nstipgeolocate)
The nstipgeolocate command can take a list of IPv4 addresses as input and produce a ASCII output file containing geolocated results in various formats.
The following shows the help output when nstipgeolocate is invoked with the --help option:
[root@dhcp143 ~]# nstipgeolocate --help Usage: nstipgeolocate [options] This program reads a list of IPv4 addresses from stdin or the command line and attempts to look up the associated latitude and longitude coordinate values to geolocate the IPv4 addresses passed. The results are then echoed to standard out in the format specified on the command line indicating the locations (if possible) for each IPv4 address processed. An associated resource file (Custom: "$HOME/.etc/nst/nstipgeolocate.py" or Global: "/etc/nst/nstipgeolocate.py") can be configured to provide latitude and longitude coordinate adjustments, private IPv4 Address/Network coordinate locations and the selection of the Geolocation database source. Alternatively, on can set the python environment variable: 'PYTHONPATH' to a directory containing a "nstipgeolocate.py" resource file. This program exits with the following return codes (useful if calling from a script): 0 - No issues encountered. 1 - A unknown issue, typically a Python error in the configuration file. 2 - The GeoIP implementation method was configured, but the GeoIP database did not exist. To test, check the exit status of: (nstipgeolocate </dev/null &>/dev/null). Options: --version show program's version number and exit -h, --help show this help message and exit -i 'IP0[ IP1[...]]', --ip-list='IP0[ IP1[...]]' Specify a list of IPv4 space separated addresses on command line instead of reading stdin -f plain|csv|xml|xmlext|wui|json, --format=plain|csv|xml|xmlext|wui|json The output format to generate: plain, csv (Comma Separated Values), xml (short XML format), xmlext (extended XML format), json (JavaScript Object Notation), or wui (for NST WUI page) [default: plain] -n, --disallow-host-names Only IPv4 addresses are processed, ignore host names or Fully Qualified Domain Names (FQDN) -r, --resolve-host-names Try to resolve Fully Qualified Domain Names (FQDN) for each IPv4 address [root@dhcp143 ~]#
While there are more than three different output formats available, we will demonstrate the csv, xmlext and json outputs as they are well suited for further processing by your own custom code.
Example IP File
The following IPv4 address will be saved to the file /tmp/ip-list.txt and used as the input to the example invocations of the nstipgeolocate command.
108.107.15.248 118.90.104.224 12.12.28.31
Comma Separated Values (CSV) Output
The following demonstrates how to produce a ASCII table of geolocated information for the IPv4 addresses in our file. The ASCII output will be in a comma separated value (CSV) format suitable for importing into a spreadsheet or processing with your own custom code:
[pkb@rice ~]$ nstipgeolocate -r -f csv < /tmp/ip-list.txt HostName,IpAddress,Latitude,Longitude,Alias,CountryName,CountryCode,CountryCode3,RegionName,Region,City,PostalCode,AreaCode,DmaCode,TimeZone 108-107-15-248.pools.spcsdns.net,108.107.15.248,33.9393997192,-84.2077026367,Norcross,United States,US,USA,Georgia,GA,Norcross,,404,524,America/New_York ip-118-90-104-224.xdsl.xnet.co.nz,118.90.104.224,-36.8666992188,174.766693115,Auckland,New Zealand,NZ,NZL,Auckland,E7,Auckland,,0,0,Pacific/Auckland 12.12.28.31,12.12.28.31,61.1692008972,-149.844299316,Anchorage,United States,US,USA,Alaska,AK,Anchorage,,907,743,America/Anchorage [pkb@rice ~]$
XML Output
The following demonstrates how to produce a ASCII XML file of geolocated information for a set of IPv4 addresses. This XML file is suitable for processing in a variety of methods and is what we use when building KML and KMZ output for many of our custom Google Earth renderings.
[pkb@rice ~]$ nstipgeolocate -r -f xmlext < /tmp/ip-list.txt <?xml version="1.0" encoding="UTF-8"?> <!-- NST Tools Generated: Wed May 04 07:14:30 EDT 2011 'nstipgeolocate' Host Geolocate Document --> <geoip> <properties> <comment>nstipgeolocate Host Geolocate Document</comment> <entry key="document_source">NST tools generated: nstipgeolocate</entry> <entry key="document_version">2130.000</entry> <entry key="create_time_formatted">2011-05-04 07:14:30.447865000-0400</entry> <entry key="create_time_secs">1304507670.447865</entry> </properties> <location> <hostname>108-107-15-248.pools.spcsdns.net</hostname> <addr>108.107.15.248</addr> <lat>33.939400</lat> <lon>-84.207703</lon> <properties> <entry key="city">Norcross</entry> <entry key="region_name">Georgia</entry> <entry key="alias">Norcross</entry> <entry key="region">GA</entry> <entry key="area_code">404</entry> <entry key="time_zone">America/New_York</entry> <entry key="metro_code">524</entry> <entry key="country_code3">USA</entry> <entry key="dma_code">524</entry> <entry key="country_code">US</entry> <entry key="country_name">United States</entry> <entry key="quality">6</entry> </properties> </location> <location> <hostname>ip-118-90-104-224.xdsl.xnet.co.nz</hostname> <addr>118.90.104.224</addr> <lat>-36.866699</lat> <lon>174.766693</lon> <properties> <entry key="city">Auckland</entry> <entry key="region_name">Auckland</entry> <entry key="alias">Auckland</entry> <entry key="region">E7</entry> <entry key="time_zone">Pacific/Auckland</entry> <entry key="country_code3">NZL</entry> <entry key="country_code">NZ</entry> <entry key="country_name">New Zealand</entry> <entry key="quality">6</entry> </properties> </location> <location> <hostname>12.12.28.31</hostname> <addr>12.12.28.31</addr> <lat>61.169201</lat> <lon>-149.844299</lon> <properties> <entry key="city">Anchorage</entry> <entry key="region_name">Alaska</entry> <entry key="alias">Anchorage</entry> <entry key="region">AK</entry> <entry key="area_code">907</entry> <entry key="time_zone">America/Anchorage</entry> <entry key="metro_code">743</entry> <entry key="country_code3">USA</entry> <entry key="dma_code">743</entry> <entry key="country_code">US</entry> <entry key="country_name">United States</entry> <entry key="quality">6</entry> </properties> </location> </geoip> [pkb@rice ~]$
JavaScript Object Notation Output
The following demonstrates how to produce a ASCII file in JavaScript Object Notation (JSON) format for a set of IPv4 addresses. This JSON file is suitable for processing in a variety of scripting languages or can be used directly by JavaScript code running within a web browser.
[pkb@rice ~]$ nstipgeolocate -r -f json < /tmp/ip-list.txt [["108.107.15.248", "108-107-15-248.pools.spcsdns.net", [33.939399719238281, -84.20770263671875, {"city": "Norcross", "region_name": "Georgia", "alias": "Norcross", "region": "GA", "area_code": 404, "time_zone": "America/New_York", "metro_code": 524, "country_code3": "USA", "postal_code": null, "dma_code": 524, "country_code": "US", "country_name": "United States", "quality": "6"}]], ["118.90.104.224", "ip-118-90-104-224.xdsl.xnet.co.nz", [-36.86669921875, 174.76669311523438, {"city": "Auckland", "region_name": "Auckland", "alias": "Auckland", "region": "E7", "area_code": 0, "time_zone": "Pacific/Auckland", "metro_code": 0, "country_code3": "NZL", "postal_code": null, "dma_code": 0, "country_code": "NZ", "country_name": "New Zealand", "quality": "6"}]], ["12.12.28.31", "12.12.28.31", [61.169200897216797, -149.84429931640625, {"city": "Anchorage", "region_name": "Alaska", "alias": "Anchorage", "region": "AK", "area_code": 907, "time_zone": "America/Anchorage", "metro_code": 743, "country_code3": "USA", "postal_code": null, "dma_code": 743, "country_code": "US", "country_name": "United States", "quality": "6"}]]] [pkb@rice ~]$
Unfortunately, this format is not human readable. However, if you are familiar with JSON files, you will find it quite useful. We use this output format frequently via AJAX calls from our NST WUI when presenting information on a Google Map rendering within a web browser.
Name Resolution Warning
In the above examples, we included the -r option which instructs the nstipgeolocate to try and resolve host names. Since there were only 3 IPv4 addresses to process this was not a big deal. However, if you have a lot of IPv4 addresses to process you may want to omit this option as it adds a significant amount of time to perform the lookups.
The following shows that the time to process 564 IPv4 processes jumped from 1.073 seconds to 92.536 seconds once we enabled the resolve host names option:
[pkb@rice ~]$ wc /tmp/ip-list-full.txt 564 564 7923 /tmp/ip-list-full.txt [pkb@rice ~]$ time nstipgeolocate -f csv < /tmp/ip-list-full.txt >/dev/null real 0m1.073s user 0m0.230s sys 0m0.086s [pkb@rice ~]$ time nstipgeolocate -r -f csv < /tmp/ip-list-full.txt >/dev/null real 1m32.536s user 0m0.456s sys 0m0.153s [pkb@rice ~]$
ASCII Output As A Service
While the nstipgeolocate command is quite useful if you are on the NST system itself, it can't be run if you are on a different system. You can however, use the NST WUI web service page to run the equivalent nstipgeolocate commands remotely. The following outlines how to do this.
The Base URL
The following URL is used to request that a set of IPv4 address be geolocated (change 192.168.1.143 to the address of your NST system):
https://192.168.1.143/nstwui/apps/geo/ipgeolocate-ajax.php
In order to use the above URL, you must provide a set of parameters. These parameters can either be appended directly on the end of the URL (a GET request), or submitted separately via a POST request. Examples of both methods will be described later. The GET approach is simpler, but the POST method is typically better suited when you need to geolocate a lot of IPv4 addresses.
Here are the parameters which can be included:
- op=nstipgeolocate
- This parameter is required and tells the service that you want to geolocate a set of IPv4 addresses.
- iplist=IP0[+IP1[+IP2...]]
- This parameter is required and allows you to tell the service what IPv4 address you would like to have geolocated. Each IPv4 address in the list is separated by a + symbol (which corresponds to the escaped space character).
- format=csv|xmlext|json|...
- This parameter is required and allows you to tell the service what type of output you would like back. If omitted, it defaults to xmlext (extended XML output). The common output formats are csv, xmlext, and json. For a full list of available output formats, look at the help output of the nstipgeolocate command.
- resolve=true|false
- This parameter can be used to turn on (or off) name resolution. Name resolution tends to be very slow when processing large sets of data. For example, it may only take a couple of seconds to process a thousand IPv4 addresses without name resolution and several minutes if name resolution is enabled. It is recommended you only set this option to true when processing a small set of IPv4 addresses.
A wget Example (using GET)
The following demonstrates using the NST web service to geolocate two IPv4 addresses using a simple wget invocation:
[pkb@rice ~]$ wget --quiet --no-check-certificate -O /tmp/ip-list.csv --user root --ask-password 'https://192.168.1.143/nstwui/apps/geo/ipgeolocate-ajax.php?op=nstipgeolocate&resolve=false&format=csv&iplist=108.102.15.248+118.90.104.224' Password for user “root”: [pkb@rice ~]$ wc /tmp/ip-list.csv 3 5 399 /tmp/ip-list.csv [pkb@rice ~]$ cat /tmp/ip-list.csv HostName,IpAddress,Latitude,Longitude,Alias,CountryName,CountryCode,CountryCode3,RegionName,Region,City,PostalCode,AreaCode,DmaCode,TimeZone 108.102.15.248,108.102.15.248,32.8097991943,-96.7993011475,Dallas,United States,US,USA,Texas,TX,Dallas,,214,623,America/Chicago 118.90.104.224,118.90.104.224,-36.8666992188,174.766693115,Auckland,New Zealand,NZ,NZL,Auckland,E7,Auckland,,0,0,Pacific/Auckland [pkb@rice ~]$
A wget Example (using POST)
The following takes a list of 564 IPv4 addresses in the file /tmp/ip-list.txt found above and builds a files containing the appropriate parmaters to use the POST method. The wget command is then used to POST this information to the web service and store the results as the XML file /tmp/ip-list.xml.
[pkb@rice ~]$ wc /tmp/ip-list.txt 564 564 7923 /tmp/ip-list.txt [pkb@rice ~]$ head -3 /tmp/ip-list.txt 108.107.15.248 109.154.109.210 109.230.220.96 [pkb@rice ~]$ rm -f /tmp/ip-list.post [pkb@rice ~]$ SEP="op=nstipgeolocate&format=xmlext&iplist="; [pkb@rice ~]$ for ip in $(cat /tmp/ip-list.txt); do echo -n "${SEP}${ip}" >> /tmp/ip-list.post; SEP="+"; done [pkb@rice ~]$ wget --no-check-certificate -O /tmp/ip-list.xml --user root --ask-password --post-file /tmp/ip-list.post --quiet 'https://192.168.1.143/nstwui/apps/geo/ipgeolocate-ajax.php' Password for user “root”: [pkb@rice ~]$ wc /tmp/ip-list.xml 10748 17690 377318 /tmp/ip-list.xml [pkb@rice ~]$ tail -21 /tmp/ip-list.xml <location> <hostname>99.59.87.159</hostname> <addr>99.59.87.159</addr> <lat>41.867500</lat> <lon>-87.674400</lon> <properties> <entry key="city">Chicago</entry> <entry key="region_name">Illinois</entry> <entry key="alias">Chicago</entry> <entry key="region">IL</entry> <entry key="area_code">312</entry> <entry key="time_zone">America/Chicago</entry> <entry key="metro_code">602</entry> <entry key="country_code3">USA</entry> <entry key="dma_code">602</entry> <entry key="country_code">US</entry> <entry key="country_name">United States</entry> <entry key="quality">6</entry> </properties> </location> </geoip> [pkb@rice ~]$
URL (Google Maps)
Strengths/Weaknesses
- Extremely simple to implement.
- Highly interactive.
- Requires web browser and access to a NST system configured for geolocation.
- Does not scale well. Both the length of a URL string and the number of locations you can plot on Google Maps can be a limiting factor (not a good solution if you have thousands of IPv4 addresses to plot).
- Optional host name resolution is fairly quick.
URL API Reference
This is a extremely simple API, you start with a base URL of (change the 192.168.1.143 to the IP address of your NST system):
https://192.168.1.143/nstwui/apps/geo/map.html
The above URL will simply pull up a empty Google Map. However, the web page which renders the map also accepts the following parameters (all are optional):
- NstMapIpList=IP0[+IP1[+IP2...]]
- This parameter allows you to tell the page what IPv4 address you would like to have geolocated. Each IPv4 address in the list is separated by a + symbol (which corresponds to the escaped space character).
- NstMapResolve=true|false
- This parameter can be used to turn on (or off) name resolution. If name resolution is enabled, the names will be shown when you click on the markers on the map and pull up the information balloon.
- NstMapFullScreen=true|false
- If this parameter is set to true, then the map will be opened in Full Screen mode (you will only see the map in the browser window - there will be no scroll bar).
- NstMapFitScreen=true|false
- If this parameter set to true, then the map will be opened in Fit Screen mode (you will the map and the control area in the browser window - there will be a scroll bar).
- NstMapMarkerDefault=NUMBER
- This parameter controls which marker symbol is used for the IPv4 addresses that are plotted on the map. It is essentially a index (starting from 0) corresponding to the available markers shown under the Options panel.
- NstMapMarkerMoved=NUMBER
- This parameter controls which marker symbol is used for the IPv4 addresses that have been moved (dragged by the user) the map. It is essentially a index (starting from 0) corresponding to the available markers shown under the Options panel.
- NstMapShowShadows=true|false
- This controls whether or not marker shadows are shown.
- NstMapAutoCenter=true|false
- This parameter controls whether or not the map should try to center itself on one of the markers. This is typically only useful when you know you are plotting exactly one marker.
- NstMapTypeId=hybrid|roadmap|terrain|satellite|sparse
- This parameter can be used to force the initial type of map shown to the end user.
- NstMapZoom=NUMBER
- This parameter can be used to force the initial zoom level in the range of 0 to 15 where 0 is zoomed all the way out and 15 is zoomed all the way in.
- NstMapCenterLat=LATITUDE
- This parameter can be used to force the initial latitude (in the range of -90.0 to 90.0) for the location that the map will center on.
- NstMapCenterLon=LONGITUDE
- This parameter can be used to force the initial longitude (in the range of -180.0 to 180.0) for the location that the map will center on.
NOTE: You will typically omit all but the NstMapIpList parameter and let everything else default to the user's preferences.
Example URL
The following builds a URL string which uses the NST system 192.168.1.143 to render three IPv4 addresses (108.107.15.248, 109.154.109.210, and 98.251.209.196) on a web page (we also included the NstMapResolve parameter to force name resolution lookup for the IPv4 addresses passed):
https://192.168.1.143/nstwui/apps/geo/map.html?NstMapResolve=true&NstMapIpList=108.107.15.248+109.154.109.210+98.251.209.196
If your NST system is set up for geolocation, you should be able to:
- Copy/paste the above URL into your browser.
- Change the 192.168.1.143 to the IP address of your NST system.
- Press the enter key.
- See the three IPv4 addresses plotted on a Google Map.
You should be able change the three IPv4 addresses assigned to the NstMapIpList parameter in the URL shown above to your own custom set of IPv4 addresses.
Example Commands
Typing in IPv4 addresses by hand gets old quick. So let's take a look at a real world example where we:
- Extract a list of IPv4 addresses from a log file.
- Use a few commands to build our URL.
- Open the URL in the Google Chrome browser.
For this example, we will extract some IPv4 addresses from a Apache access file. Here is a snippet of the log file we will be extracting the IPv4 addresses from:
66.92.170.40 - - [03/May/2011:09:16:53 -0600] "GET /nst/images/nstwiki_110x32.gif HTTP/1.1" 200 2443 "http://www.networksecuritytoolkit.org/nst/" "Mozilla/5.0 (Windows NT 6.1; WOW64) AppleWebKit/534.24 (KHTML, like Gecko) Chrome/11.0.696.60 Safari/534.24" 66.92.170.40 - - [03/May/2011:09:16:53 -0600] "GET /nst/images/nstpro_110x32.gif HTTP/1.1" 200 2375 "http://www.networksecuritytoolkit.org/nst/" "Mozilla/5.0 (Windows NT 6.1; WOW64) AppleWebKit/534.24 (KHTML, like Gecko) Chrome/11.0.696.60 Safari/534.24" 66.92.170.40 - - [03/May/2011:09:16:53 -0600] "GET /nstpro/images/npc_conv_ge_thumb.png HTTP/1.1" 200 58331 "http://www.networksecuritytoolkit.org/nst/" "Mozilla/5.0 (Windows NT 6.1; WOW64) AppleWebKit/534.24 (KHTML, like Gecko) Chrome/11.0.696.60 Safari/534.24"
This is a easy log file to extract the IPv4 addresses from. We'll simply combine the use of the awk command (to print the first word of each line) with the use of the sort command to filter out duplicates:
[pkb@rice ~]$ gzip -dc < /tmp/access_log.gz | awk -- '{ print $1; }' | sort -u >| /tmp/ip-list.txt [pkb@rice ~]$ wc /tmp/ip-list.txt 564 564 7923 /tmp/ip-list.txt [pkb@rice ~]$
The following takes a list of 564 IPv4 addresses found above and builds a very long URL. It then opens the URL using Google Chrome:
[pkb@rice ~]$ MAP_URL="https://192.168.1.143/nstwui/apps/geo/map.html?NstMapResolve=true&NstMapIpList="; [pkb@rice ~]$ SEP="${MAP_URL}"; URL=""; [pkb@rice ~]$ for ip in $(cat /tmp/ip-list.txt); do URL="${URL}${SEP}${ip}"; SEP="+"; done [pkb@rice ~]$ google-chrome "${URL}" Created new window in existing browser session. [pkb@rice ~]$
Here is what shows up in the browser from running the commands shown above:
KML Output (Google Earth)
Producing KML output involves the following steps:
- Get a list of IPv4 addresses you would like to geolocate.
- Use the nstipgeolocate command or ipgeolocate-ajax.php web service to convert to XML (the xmlext format as detailed in the "#ASCII_Output_API_.28nstipgeolocate.29" and "#ASCII Output As A Service" sections above).
- Apply a XSL style sheet to convert the XML file into a KML file.
- Open the KML file in Google Earth.
Creating a XSL style sheet to convert the XML file produced by nstgeolocate to KML is a non-trivial task. However, a working template is available at: http://nst.svn.sourceforge.net/viewvc/nst/trunk/include/xsl/geoip2kml.xsl.
Getting a list of IPv4 Addresses
How you do this is up to you. There are many locations where you can get IPv4 addresses. For this example, we'll grab them from a Apache access_log file (where Apache puts the IPv4 address as the first word of each log entry):
[pkb@rice ~]$ gzip -dc /tmp/access_log.gz | head -1 66.92.170.40 - - [03/May/2011:09:16:53 -0600] "GET /nst/images/sf-icon2.gif HTTP/1.1" 200 2934 "http://www.networksecuritytoolkit.org/nst/" "Mozilla/5.0 (Windows NT 6.1; WOW64) AppleWebKit/534.24 (KHTML, like Gecko) Chrome/11.0.696.60 Safari/534.24" [pkb@rice ~]$ gzip -dc /tmp/access_log.gz | awk -- '{ print $1; }' | sort -u >| /tmp/ip-list.txt [pkb@rice ~]$ head -3 /tmp/ip-list.txt 108.107.15.248 109.154.109.210 109.230.220.96 [pkb@rice ~]$ wc /tmp/ip-list.txt 564 564 7923 /tmp/ip-list.txt [pkb@rice ~]$
We now have a file with 564 unique IPv4 addresses in it.
Convert To XML
We'll now use the nstipgeolocate command to convert the IPv4 address list into a XML file with the geolocated values associated with each IPv4 address:
[pkb@rice ~]$ nstipgeolocate --format xmlext < /tmp/ip-list.txt >| /tmp/ip-list.xml [pkb@rice ~]$
Convert To KML
We'll now use the xsltproc and the template XSL file from the NST Subversion repository to convert the XML file into a KML file for Google Earth:
[pkb@rice ~]$ wget -O /tmp/geoip2kml.xsl 'http://nst.svn.sourceforge.net/viewvc/nst/trunk/include/xsl/geoip2kml.xsl' --2011-05-04 17:27:34-- http://nst.svn.sourceforge.net/viewvc/nst/trunk/include/xsl/geoip2kml.xsl Connecting to 192.168.1.2:3128... connected. Proxy request sent, awaiting response... 200 OK Length: unspecified [text/xml] Saving to: “/tmp/geoip2kml.xsl” [ <=> ] 2,868 --.-K/s in 0.002s 2011-05-04 17:27:34 (1.57 MB/s) - “/tmp/geoip2kml.xsl” saved [2868] [pkb@rice ~]$ xsltproc /tmp/geoip2kml.xsl /tmp/ip-list.xml >| /tmp/ip-list.kml [pkb@rice ~]$
NOTE: You will likely want to customize the /tmp/geoip2kml.xsl file. It's just meant to serve as a starting point.
Open In Google Earth
You should be able to open the KML file produced using a tool like Google Earth:
[pkb@rice ~]$ google-earth /tmp/ip-list.kml [pkb@rice ~]$
Here is what the KML rendering resembles: