I am working on a text summarization method ,for test my method i have a benchmark called doc 2007
,inside this benchmark i have a lot of xml file ,i should clear that file .
for example i have a xml
file like this:
<sentence id='s0'>
The nature of the proceeding
1 The principal issue in this proceeding is whether the Victorian Arts Centre falls within the category of 'premises of State Government Departments and Instrumentalities', for the purposes of provisions in industrial awards relating to rates of payment for persons employed in cleaning those premises.</sentence>
<sentence id='s1'>In turn, this depends upon whether the Victorian Arts Centre Trust, a statutory corporation established by the Victorian Arts Centre Act 1979 (Vic) ('the VAC Act'), is properly described as a State Government department or instrumentality, for the purposes of the award provisions.</sentence>
;
I should extract the string between <sentence id='s0'></sentence>
and <sentence id='s1'></sentence>
I mean the result should be like this :
The nature of the proceeding
1 The principal issue in this proceeding is whether the Victorian Arts Centre falls within the category of 'premises of State Government Departments and Instrumentalities', for the purposes of provisions in industrial awards relating to rates of payment for persons employed in cleaning those premises.
In turn, this depends upon whether the Victorian Arts Centre Trust, a statutory corporation established by the Victorian Arts Centre Act 1979 (Vic) ('the VAC Act'), is properly described as a State Government department or instrumentality, for the purposes of the award provisions.
I found some thing like this :
Regex.Match("User name (sales)", @"\(([^)]*)\)").Groups[1].Value
using Regex
,but it doesn't work .could you please give me a fast solution to do that?