I'm looking at what is needed to move from wordpress.com to a BlogEngine.NET or similar blog. I've seen a tool for replacing export.php so that it will export your wordpress site in BlogML format so it can easily be imported into BlogEngine.NET, however I'd really not want to have to setup php/wordpress just so I can import a back up from wordpress.com and then use the export from my local wordpress to have a BlogML file.
Are there any tools that will convert the wordpress file? Is there a different blog that will natively import the wordpress file?
Edit: For the question about other blog providers, I am open to them as long as they are .NET based, preferably C#.
Are there any tools that will convert the wordpress file?
wordpress.com exports in XML, so that might be easier than using feeds or otherwise grabbing html and parsing it. Navicat supports XML into PHPMyAdmin, which exports SQL.
Is there a different blog that will natively import the wordpress file?
Are you open to using platforms other than BlogEngine?
I don't know of any tool but you may be able to parse the content. Sort of a fun experiment ;)
The url for the Archives is: yourblog.wordpress.com/year/month; e.g. yourblog.wordpress.com/2009/11. The content for the page is wrapped in a div with the id="content". Each post is wrapped in a child div that contains a class called "post", and the target href is contained in a child H2 tag.
It looks roughly like this:
Potentially you could do a get for each monthly archive page and use jQuery to parse out the href to each post. Once you have all the hrefs you could run another process that would pull down each html file. You would also need a process for the images. To keep things easy you could create folder structure that mirrors the archives.