I need to get a list of files to then filter this set
DECLARE @input_file string = @"\data\{*}\{*}\{*}.avro";
@filenames = SELECT filename
FROM @input_file;
@filtered = SELECT filename FROM @filenames WHERE {condition}
Something like this if it's possible...
The way to do that is define virtual columns in your fileset. You can then extract and manipulate these virtual columns like they were data fields extracted from your file. Example:
U-SQL will also not even read files that don't match the WHERE clause, saving you some time. (Also, the underscore in the virtual column name is not necessary, but a useful way to remember which columns came from the file and which from the path). Hope this helps!