I'm trying to get the URLs for images (all MIME types) in a remote CSS file using Java.
I am using jsoup to get the URL of the css.
After countless hours of looking at CSS Parser I couldn't figure it out due to the lack of documentation.
I also looked at some other treads, but have just confused me even more:
I've also seen some examples using regex, but I am not too familiar how to implement it in java.
Would anyone have some suggestions on how to go at this problem?
In Java, you have to use a
Pattern
and aMatcher
from thejava.util.regex
package.You compile your pattern, then you instantiate your matcher with your string and then you look for everything that matches your pattern.
The CSS 2.1 spec states:
Thus you could use a regex like this one:
The
.*?
is non-greedy allowing you to take as few characters as necessary. The possessive quantifier avoids any backtrack in['"]?+
.You may also use ph-css for this. See the example "Visit all URLs contained in a CSS" located at https://github.com/phax/ph-css#code-examples. Can't do it much easier :)