XPathQueryAPI

Gives you the ability to query a webpage using a "SQL" like language using XPaths to parse through webpages.

Allows you to host a web API to allow you to easily query website data that might not be exposed through an API.

An example:

http://www.nasdaq.com/symbol/msft/historical

Grabbing the data in the table:

The XPATH to the first row Open column: //*[@id="quotes_content_left_pnlAJAX"]/table/tbody/tr[1]/td[2]

We can do nested xpaths on the URL in our SELECT FROM to reduce the size of our column xpath selectors:

SELECT ".//td[2]" as "Open", ".//td[3]" as "High", ".//td[4]" as "Low" FROM "http://www.nasdaq.com/symbol/msft/historical"."//*[@id=\"quotes_content_left_pnlAJAX\"]/table/tbody/tr"

First we download the website, then perform the following XPATH on the document node.

Then, for each node in the collection, we perform the xpath for each desired column.

As might be recognized, we can rename the column with the 'as' so in the JSON response, we'll see a friendlier name for the column.

A sample result from thsi would look like:

[ {"Open":85.40, "High":85.595, "Low":84.92}, {"Open":85.40, "High":85.595, "Low":84.92}, {"Open":85.40, "High":85.595, "Low":84.92}, {"Open":85.40, "High":85.595, "Low":84.92}, {"Open":85.40, "High":85.595, "Low":84.92}, {"Open":85.40, "High":85.595, "Low":84.92} ]

We can also select attributes by specifying "/@attributeName". So "/a/@href" would match the href attribute for 'Hi there' returning "hello".

Note that you must escape strings in query selectors. The language has support for TOP, WHERE, and SORT but those are not yet implemented. I'd also like to add support for constants and helper functions like STRLEN, REVERSE, SPLIT, other string functions, as well as support basic math.

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
dab.xpath.antlr		dab.xpath.antlr
dab.xpath.api		dab.xpath.api
dab.xpath.query.test		dab.xpath.query.test
dab.xpath.query		dab.xpath.query
.dockerignore		.dockerignore
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
dab.xpath.api.sln		dab.xpath.api.sln

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

XPathQueryAPI

About

Releases

Packages

Languages

License

dabbers/XPathQueryAPI

Folders and files

Latest commit

History

Repository files navigation

XPathQueryAPI

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages