如何用正则表达式去掉html标签

2024-11-23 00:41:24

推荐回答（4个）

回答1：

　　用正则表达式去掉html标签，下面是它的代码，直接复制就可以用的。
　　代码：
　　public
　　static string StripHTML(string HTML) //google "StripHTML" 得到 {
　　string[] Regexs = {
　　@"]*?>.*?",
　　@"<(\/\s*)?!?((\w+:)?\w+)(\w+(\s*=?\s*(([""'])(\\[""'tbnr]|[^\7])*?\7|\w+)|.{0})|\s)*?(\/\s*)?>",
　　@"([\r\n])[\s]+", @"&(quot|#34);",
　　@"&(amp|#38);", @"&(lt|#60);",
　　@"&(gt|#62);", @"&(nbsp|#160);",
　　@"&(iexcl|#161);",
　　@"&(cent|#162);",
　　@"&(pound|#163);",
　　@"&(copy|#169);", @"&#(\d+);",
　　@"-->", @" Other text';

2、定义正则表达式并替换
$val = preg_replace('/<[^<]+?>/g', ' ', $row_get_Business['business_description']);

3、截取指定长度
$businessDesc = substr(val,0,110);

回答3：

string regexstr = @"<[^>]*>";
, context = Regex.Replace(contextregexstr, string.Empty, RegexOptions.IgnoreCase);
context = context.Replace(" ", "");

回答4：

http://zhidao.baidu.com/question/319589344.html?oldq=1 http://zhidao.baidu.com/question/319650743.html?oldq=1 楼主参考我回考别人的两个问题吧...