I have some unicode .html files that I want to display inside a THtmlViewer component, in Delphi.
I can't seem to persuade the code to work just doing '.LoadFromFile' - do I firstly need to load the unicode file into a stream and then somehow convert it?
Delphi 2007, THtmlViewer v9.45
I've not done anything with unicode files, or THtmlViewer, before.
FYI, THTMLViewer is actively maintained on google code (last commit couple minutes ago):
http://code.google.com/p/thtmlviewer/
D6-DXE2 and Lazarus compatibility, ton of fixes and improvements from the "original" (9.45) version.
Okay, well here's the guts of what I came up. Constructive criticism and observations appreciated!
// load either an ansi or unicode-type html doc into the browser component.
// the filename has already been confirmed as an existing file
procedure TfrmBrowser.LoadDocument(FFileName:string);
var
FWideText : Widestring;
FAnsiText : AnsiString;
FRequiredLen : Integer;
FFileStream : TFileStream;
FMemStream : TMemoryStream;
FBuffer : Byte;
begin
FFileStream := TFileStream.Create(FFileName, fmOpenRead or fmShareDenyNone);
// anything less than half a dozen bytes would be pointless, but...
if FFileStream.Size>1 then
begin
// checking the first byte of the file to give us a clue about file-type
FFileStream.Read(FBuffer,1);
FFileStream.Position:=0; // rewind position
if (FBuffer=$FF) or (FBuffer=$EF) then
begin
// probably Unicode
FRequiredLen := FFileStream.Size div 2; // 2 bytes per char
SetLength(FWideText, FRequiredLen);
FFileStream.Read(FWideText[1], FFileStream.Size);
// cast it into an Ansistring
FAnsiText := FWideText;
FMemStream := TMemoryStream.Create;
FMemStream.Write(FAnsiText[1], FRequiredLen);
FMemStream.Position := 0; // rewind the position
// load the stream into the THtmlViewer
vwBrowse.LoadFromStream(FMemStream);
FMemStream.Free;
end
else
begin
// probably Ansi, just load original filestream in
vwBrowse.LoadFromStream(FFileStream);
end;
FFileStream.Free;
end;
Obviously missing some error-trapping, but that's the basic idea.
You are using Delphi 2007. That's before the beginning of Unicode era in Delphi Programming!
Although it is very tiresome to make Unicode work in early versions of Delphi, it is quite possible to attain a satisfactory result in some controls, especially the THtmlView component.
I post some code example from one of my programs:
//code to toggle source or WYSIWYG views
var
htmEd: IHTMLDocument2;
begin
htmEd := HtmlEdit.Document as IHtmlDocument2;
if ToggleTabSet.TabIndex = 0 then
begin
HtmlEditContainer.PageIndex := 0; // Tab sheet index
htmEd.body.innerHTML := Memo1.Lines.Text; // TTntMemo
pnlEditorState.Caption := 'Design View';
end
else
if ToggleTabSet.TabIndex = 1 then
begin
HtmlEditContainer.PageIndex := 1;
Memo1.Lines.Text := HtmEd.body.innerHTML;
pnlEditorState.Caption := 'Source View';
end;
Reading the above code you can see that I'm using a TTntMemo component to which the html file is loaded first. I'm then loading the 'Text' of the memo to HtmlView's 'body.innerHTML' property.
htmEd.body.innerHTML := Memo1.Lines.Text;
Note:
- TntWare's 'Memo1.Lines.Text;' is WideString type.
- 'IHTMLDocument2' comes from TEmbeddedWB. See reasons for why TEmbeddedWB is good?
That is what worked for me in early days. I have switched to Delphi 2009, and things are much easier now (just setting suitable TEncoding while loading files)!