PHP regex breaking special characters

2019-09-10 15:27发布

问题:

var_dump(
  preg_replace( 
    array( 
      '#[\s\n\\n]*<[\/\s]?(br|/p|/div)[\/\s]?>[\s\n\\n]*#iu', '#\s+#' ), 
      ' ', "A 19th century Dutch walnut secrétaire à abattant") 
    )
  )
); 

The result is:

string(102) "A 19th century Dutch walnut secrétaire � abattant" 

Why does my Regex breaking special character in PHP?

回答1:

If you set your character encoding right, there is no issue. See your code run on eval.in.

Make sure to have this in your html document:

<!DOCTYPE html>
<html>
<head>
     <meta charset="utf-8">
</head>

And also save your PHP file as UTF-8 encoded. Depending on your editor this might be an option in the "Save As" dialog, or under the "preferences" or "options" menu.