I’m trying to find the best approach to create my sitemap (I deal with a huge number of URLs).
I have an XML Sitemap with a tree-structure, it’s composed of many sitemap.xml. Actually I have max 1000 URLs for each sitemap.xml.
What's the optimal number of urls for each sitemap.xml? Does Google prefers a larger number of URLs (greater than 1000), or less URLs distributed in more XML files?
I was experiencing the exact same issue, I dealt with 80k+ URLs in my WordPress website.
Previously I was providing 5000 URLs in each sitemap. It was resoursce hunger & Google was not crawling it suffecinotly. Then I made 2000 URLs each sitemap then crawling rate became somewhat okay. Then I decided to make 1000 URLs each sitemap & within 2 days my crawling rate became very good. within 10 days 95% of the URLs were indexed.
So, I would say go with 1000 or lower.
From the sitemaps.org protocol specification:
You can include up to 50,000 URLs in each Sitemap file:
You can link up to 50,000 Sitemap files in each Sitemap index file:
You can have multiple Sitemap index files (no limit stated):