I'm scraping a static html site and moving the content into a database-backed CMS. I'd like to use Textile in the CMS.
Is there a tool out there that converts HTML into Textile, so I can scrape the existing site, convert the HTML to Textile, and insert that data into the database?
Since there was no javascript implementation, I wrote one: https://github.com/cmroanirgo/to-textile
It's a little primitive at the moment, as it's a blind port of the 'to-markdown' equivalent, but should get the job done.
I know this is an old question, but I found myself trying to do this the other day and not finding anything useful, until I found Pandoc. It can convert loads of other markup formats as well - it's quite brilliant.
try this simple java code hope it work for you
Here is a c# lib converting html 2 textile. Though it is textile with their additions. Not pure textile.
This is a simple markup replacement, nothing a good regex could not fix.
I recommend Perl, LWP::Simple and some regexes to do the whole thing (spidering, stripping design and menus, converting to textile, and then posting to the database.)