How can I extract whatever follows the last slash in a URL in Python? For example, these URLs should return the following:
URL: http://www.test.com/TEST1
returns: TEST1
URL: http://www.test.com/page/TEST2
returns: TEST2
URL: http://www.test.com/page/page/12345
returns: 12345
I've tried urlparse, but that gives me the full path filename, such as page/page/12345
.
partition
andrpartition
are also handy for such things:urlparse is fine to use if you want to (say, to get rid of any query string parameters).
Output:
You don't need fancy things, just see the string methods in the standard library and you can easily split your url between 'filename' part and the rest:
So you can get the part you're interested in simply with:
You cand do like this:
Where tail will be your file name.
Output:
TEST2
.