Using JsonSerializer.Serialize(obj);
will produce a escaped string, but I want the unescaped version. for example:
using System;
using System.Text.Json;
public class Program
{
public static void Main()
{
var a = new A{Name = "你好"};
var s = JsonSerializer.Serialize(a);
Console.WriteLine(s);
}
}
class A {
public string Name {get; set;}
}
will produce a string {"Name":"\u4F60\u597D"}
but I want {"Name":"你好"}
I created a code snippet at https://dotnetfiddle.net/w73vnO please help me.
To change the escaping behavior of the
JsonSerializer
you can pass in a customJavascriptEncoder
to theJsonSerializer
by setting theEncoder
property on theJsonSerializerOptions
.https://docs.microsoft.com/en-us/dotnet/api/system.text.json.jsonserializeroptions.encoder?view=netcore-3.0#System_Text_Json_JsonSerializerOptions_Encoder
The default behavior is designed with security in mind and the
JsonSerializer
over-escapes for defense-in-depth.If all you are looking for is escaping certain "alphanumeric" characters of a specific non-latin language, I would recommend that you instead create a
JavascriptEncoder
using theCreate
factory method rather than using theUnsafeRelaxedJsonEscaping
encoder.Doing so keeps certain safe-guards, for instance, HTML-sensitive characters will continue to be escaped.
I would caution against using
System.Text.Encodings.Web.JavaScriptEncoder.UnsafeRelaxedJsonEscaping
flippantly since it does minimal escaping (which is why it has "unsafe" in the name). If the JSON you are creating is written to a UTF-8 encoded file on disk or if its part of web request which explicitly sets the charset to utf-8 (and is not going to potentially be embedded within an HTML component as is), then it is probably OK to use this.See the remarks section within the API docs: https://docs.microsoft.com/en-us/dotnet/api/system.text.encodings.web.javascriptencoder.unsaferelaxedjsonescaping?view=netcore-3.0#remarks
You could also consider specifying
UnicodeRanges.All
if you expect/need all languages to remain un-escaped. This still escapes certain ASCII characters that are prone to security vulnerabilities.For more information and code samples, see: https://docs.microsoft.com/en-us/dotnet/standard/serialization/system-text-json-how-to?view=netcore-3.0#customize-character-encoding
See the Caution Note
You need to set the JsonSerializer options not to encode those strings.
Then you pass this options when you call your
Serialize
method.Full code:
Result:
If you need to print the result in the console, you may need to install additional language. Please refer here.