How do I count the number of words in a string?

2020-03-23 18:09发布

I need to count the number of words and I am assuming the correct way to do it is by calculating the number of times that the previous character in a string is not a letter (ie other characters) because this is to assume that there would be colons,spaces,tabs, and other signs in the string. So at first my idea was to loop through each character and count how many times that you will not get a letter of an alphabet

    for(int i = 0; i < string.length(); i++) {
      for(int j = 0; i < alphabets.length(); j++) {
       if (string.charAt(i-1) == alphabets.charAt(j)) {
           counter++;
       }
     }
   }

However I will always get an array out of bounds because of this. So, I kinda need a little help or another way that can actually be more efficient. I thought of using Matches to only [a-zA-z] but I'm not sure how do I handle a char to be comparable to a string in counting how many times it occurs.

Thank you

标签: java string
8条回答
我想做一个坏孩纸
2楼-- · 2020-03-23 18:41

Addressing the code directly, your first loop has i=0 as the first value of i, but then you ask for

string.charAt(i-1) = string.charAt(-1),

which is where your array-out-of-bounds is coming from.

The second loop has another problem:

for(int j = 0; i < alphabets.length(); j++) {

You may also want to consider apostrophes as parts of words as well.

查看更多
beautiful°
3楼-- · 2020-03-23 18:42

The reason you are getting an IndexOutOfBoundsException is probably because when i is 0 your inner loop will have string.charAt(i-1) which will throw an exception since 0-1 is -1. If you fix that your method might work, although you can use more efficient techniques.

查看更多
别忘想泡老子
4楼-- · 2020-03-23 18:42
   if (string.charAt(i-1) == alphabets.charAt(j)) {
       counter++;
   }

You are incrementing the counter if the character is some alphabet character. You should increment it if it is no alphabet character.

查看更多
别忘想泡老子
5楼-- · 2020-03-23 18:47

Your suggestion to use a regex like "[A-Za-z]" would work fine. In a split command, you'd split on the inverse, like:

String[] words = "Example test: one, two, three".split("[^A-Za-z]+");

EDIT: If you're just looking for raw speed, this'll do the job more quickly.

public static int countWords(String str) {
    char[] sentence = str.toCharArray();
    boolean inWord = false;
    int wordCt = 0;
    for (char c : sentence) {
        if (c >= 'a' && c <= 'z' || c >= 'A' && c <= 'Z') {
            if (!inWord) {
                wordCt++;
                inWord = true;
            }
        } else {
            inWord = false;
        }
    }
    return wordCt;
}
查看更多
放我归山
6楼-- · 2020-03-23 18:54

Use just like this

String s = "I am Juyel Rana, from Bangladesh";
int count = s.split(" ").length;
查看更多
▲ chillily
7楼-- · 2020-03-23 18:55

You can use String.split() to convert the string into an array, with one word in each element. The number of words is given by the length of the array:

int words = myString.split("\s+").length;
查看更多
登录 后发表回答