Assuming I have an Amazon product URL like so
http://www.amazon.com/Kindle-Wireless-Reading-Display-Generation/dp/B0015T963C/ref=amb_link_86123711_2?pf_rd_m=ATVPDKIKX0DER&pf_rd_s=center-1&pf_rd_r=0AY9N5GXRYHCADJP5P0V&pf_rd_t=101&pf_rd_p=500528151&pf_rd_i=507846
How could I scrape just the ASIN using javascript? Thanks!
@Gumbo: Your code works great!
//JS Test: Test it into firebug.
I add a php function that makes the same thing.
this is my universal amazon ASIN regexp:
The Wikipedia article on ASIN (which I've linkified in your question) gives the various forms of Amazon URLs. You can fairly easily create a regular expression (or series of them) to fetch this data using the
match()
method.A little bit of change to the regex of the first answer and it works on all the urls I have tested.
Actually, the top answer doesn't work if it's something like amazon.com/BlackBerry... (since BlackBerry is also 10 characters).
One workaround (assuming the ASIN is always capitalized, as it always is when taken from Amazon) is (in Ruby):
I've found it to work on thousands of URLs.
something like this should work (not tested)