Get Position in Original String from `StringTokeni

I need to get the space-separated tokens in a string, but I also need to know the character position within the original string at which each token starts. Is there any way to do this with StringTokenizer. Also, as I understand it, this is a legacy class; is there a better alternative to using StringTokenizer.

标签： java stringtokenizer

3条回答

Root（大扎）

2楼-- · 2019-06-23 22:26

I improved micha's answer, so that it can handle neighboring spaces:

String text = "hello  world     example";
int start = 0;
for (String token : text.split("[\u00A0 \n]")) {
    if (token.length() > 0) {
        start = text.indexOf(token, start);
        System.out.println("token: " + token + ", start at: " + start);
    }
}

Output is:

token: hello, start at: 0
token: world, start at: 7
token: example, start at: 17

0人赞添加讨论(0) 举报

Rolldiameter

3楼-- · 2019-06-23 22:33

You should always use String#split() to split your string rather than StringTokenizer.

However, since you also want the position of the tokens in your string, then it would be better to use Pattern and Matcher class. You have got Matcher#start() method which gives the position of the string matching the pattern.

Here's an example: -

String str = "abc asf basdfasf asf";
Matcher matcher = Pattern.compile("\\S+").matcher(str);

while (matcher.find()) {
    System.out.println(matcher.start() + ":" + matcher.group());
}

The pattern \\S+ matches the non-space characters from that string. Using Matcher#find() methods returns all the matched substring.

0人赞添加讨论(0) 举报

看我几分像从前

4楼-- · 2019-06-23 22:39

You can easily do this yourself using String.split()

 String text = "hello world example";
 int tokenStartIndex = 0;
 for (String token : text.split(" ")) {      
   System.out.println("token: " + token + ", tokenStartIndex: " + tokenStartIndex);
   tokenStartIndex += token.length() + 1; // +1 because of whitespace
 }

this prints:

token: hello, tokenStartIndex: 0
token: world, tokenStartIndex: 6
token: example, tokenStartIndex: 12

0人赞添加讨论(0) 举报

Get Position in Original String from `StringTokeni

采纳回答

编辑标签

举报内容

检举类型

检举原因

检举说明(必填)

打开微信“扫一扫”，打开网页后点击屏幕右上角分享按钮

付费偷看金额在0.1-10元之间