-->

Remove all but numbers from NSString

2019-01-01 02:51发布

问题:

I have an NSString (phone number) with some parenthesis and hyphens as some phone numbers are formatted. How would I remove all characters except numbers from the string?

回答1:

Old question, but how about:

  NSString *newString = [[origString componentsSeparatedByCharactersInSet:
                [[NSCharacterSet decimalDigitCharacterSet] invertedSet]] 
                componentsJoinedByString:@\"\"];

It explodes the source string on the set of non-digits, then reassembles them using an empty string separator. Not as efficient as picking through characters, but much more compact in code.



回答2:

There\'s no need to use a regular expressions library as the other answers suggest -- the class you\'re after is called NSScanner. It\'s used as follows:

NSString *originalString = @\"(123) 123123 abc\";
NSMutableString *strippedString = [NSMutableString 
        stringWithCapacity:originalString.length];

NSScanner *scanner = [NSScanner scannerWithString:originalString];
NSCharacterSet *numbers = [NSCharacterSet 
        characterSetWithCharactersInString:@\"0123456789\"];

while ([scanner isAtEnd] == NO) {
  NSString *buffer;
  if ([scanner scanCharactersFromSet:numbers intoString:&buffer]) {
    [strippedString appendString:buffer];

  } else {
    [scanner setScanLocation:([scanner scanLocation] + 1)];
  }
}

NSLog(@\"%@\", strippedString); // \"123123123\"

EDIT: I\'ve updated the code because the original was written off the top of my head and I figured it would be enough to point the people in the right direction. It seems that people are after code they can just copy-paste straight into their application.

I also agree that Michael Pelz-Sherman\'s solution is more appropriate than using NSScanner, so you might want to take a look at that.



回答3:

The accepted answer is overkill for what is being asked. This is much simpler:

NSString *pureNumbers = [[phoneNumberString componentsSeparatedByCharactersInSet:[[NSCharacterSet decimalDigitCharacterSet] invertedSet]] componentsJoinedByString:@\"\"];


回答4:

This is great, but the code does not work for me on the iPhone 3.0 SDK.

If I define strippedString as you show here, I get a BAD ACCESS error when trying to print it after the scanCharactersFromSet:intoString call.

If I do it like so:

NSMutableString *strippedString = [NSMutableString stringWithCapacity:10];

I end up with an empty string, but the code doesn\'t crash.

I had to resort to good old C instead:

for (int i=0; i<[phoneNumber length]; i++) {
    if (isdigit([phoneNumber characterAtIndex:i])) {
        [strippedString appendFormat:@\"%c\",[phoneNumber characterAtIndex:i]];
    }
}


回答5:

Though this is an old question with working answers, I missed international format support. Based on the solution of simonobo, the altered character set includes a plus sign \"+\". International phone numbers are supported by this amendment as well.

NSString *condensedPhoneNumber = [[phoneNumber componentsSeparatedByCharactersInSet:
              [[NSCharacterSet characterSetWithCharactersInString:@\"+0123456789\"]
              invertedSet]] 
              componentsJoinedByString:@\"\"];

The Swift expressions are

var phoneNumber = \" +1 (234) 567-1000 \"
var allowedCharactersSet = NSMutableCharacterSet.decimalDigitCharacterSet()
allowedCharactersSet.addCharactersInString(\"+\")
var condensedPhoneNumber = phoneNumber.componentsSeparatedByCharactersInSet(allowedCharactersSet.invertedSet).joinWithSeparator(\"\")

Which yields +12345671000 as a common international phone number format.



回答6:

Here is the Swift version of this.

import UIKit
import Foundation
var phoneNumber = \" 1 (888) 555-5551    \"
var strippedPhoneNumber = \"\".join(phoneNumber.componentsSeparatedByCharactersInSet(NSCharacterSet.decimalDigitCharacterSet().invertedSet))


回答7:

Swift version of the most popular answer:

var newString = join(\"\", oldString.componentsSeparatedByCharactersInSet(NSCharacterSet.decimalDigitCharacterSet().invertedSet))

Edit: Syntax for Swift 2

let newString = oldString.componentsSeparatedByCharactersInSet(NSCharacterSet.decimalDigitCharacterSet().invertedSet).joinWithSeparator(\"\")

Edit: Syntax for Swift 3

let newString = oldString.components(separatedBy: CharacterSet.decimalDigits.inverted).joined(separator: \"\")


回答8:

Thanks for the example. It has only one thing missing the increment of the scanLocation in case one of the characters in originalString is not found inside the numbers CharacterSet object. I have added an else {} statement to fix this.

NSString *originalString = @\"(123) 123123 abc\";
NSMutableString *strippedString = [NSMutableString 
        stringWithCapacity:originalString.length];

NSScanner *scanner = [NSScanner scannerWithString:originalString];
NSCharacterSet *numbers = [NSCharacterSet 
        characterSetWithCharactersInString:@\"0123456789\"];

while ([scanner isAtEnd] == NO) {
  NSString *buffer;
  if ([scanner scanCharactersFromSet:numbers intoString:&buffer]) {
    [strippedString appendString:buffer];
  }
  // --------- Add the following to get out of endless loop
  else {
     [scanner setScanLocation:([scanner scanLocation] + 1)];
  }    
  // --------- End of addition
}

NSLog(@\"%@\", strippedString); // \"123123123\"


回答9:

It Accept only mobile number

NSString * strippedNumber = [mobileNumber stringByReplacingOccurrencesOfString:@\"[^0-9]\" withString:@\"\" options:NSRegularExpressionSearch range:NSMakeRange(0, [mobileNumber length])];


回答10:

It might be worth noting that the accepted componentsSeparatedByCharactersInSet: and componentsJoinedByString:-based answer is not a memory-efficient solution. It allocates memory for the character set, for an array and for a new string. Even if these are only temporary allocations, processing lots of strings this way can quickly fill the memory.

A memory friendlier approach would be to operate on a mutable copy of the string in place. In a category over NSString:

-(NSString *)stringWithNonDigitsRemoved {
    static NSCharacterSet *decimalDigits;
    if (!decimalDigits) {
        decimalDigits = [NSCharacterSet decimalDigitCharacterSet];
    }
    NSMutableString *stringWithNonDigitsRemoved = [self mutableCopy];
    for (CFIndex index = 0; index < stringWithNonDigitsRemoved.length; ++index) {
        unichar c = [stringWithNonDigitsRemoved characterAtIndex: index];
        if (![decimalDigits characterIsMember: c]) {
            [stringWithNonDigitsRemoved deleteCharactersInRange: NSMakeRange(index, 1)];
            index -= 1;
        }
    }
    return [stringWithNonDigitsRemoved copy];
}

Profiling the two approaches have shown this using about 2/3 less memory.



回答11:

You can use regular expression on mutable string:

NSRegularExpression *regex = [NSRegularExpression regularExpressionWithPattern:
                                @\"[^\\\\d]\"
                                options:0
                                error:nil];

[regex replaceMatchesInString:str
                      options:0 
                        range:NSMakeRange(0, str.length) 
                 withTemplate:@\"\"];


回答12:

Built the top solution as a category to help with broader problems:

Interface:

@interface NSString (easyReplace)
- (NSString *)stringByReplacingCharactersNotInSet:(NSCharacterSet *)set 
                                             with:(NSString *)string;
@end

Implemenation:

@implementation NSString (easyReplace)
- (NSString *)stringByReplacingCharactersNotInSet:(NSCharacterSet *)set 
                                             with:(NSString *)string
{
    NSMutableString *strippedString = [NSMutableString
                                       stringWithCapacity:self.length];

    NSScanner *scanner = [NSScanner scannerWithString:self];

    while ([scanner isAtEnd] == NO) {
        NSString *buffer;
        if ([scanner scanCharactersFromSet:set intoString:&buffer]) {
            [strippedString appendString:buffer];
        } else {
            [scanner setScanLocation:([scanner scanLocation] + 1)];
            [strippedString appendString:string];
        }
    }
    return [NSString stringWithString:strippedString];
}
@end

Usage:

NSString *strippedString = 
 [originalString stringByReplacingCharactersNotInSet:
   [NSCharacterSet setWithCharactersInString:@\"01234567890\" 
                                        with:@\"\"];


回答13:

Swift 3

let notNumberCharacters = NSCharacterSet.decimalDigits.inverted
let intString = yourString.trimmingCharacters(in: notNumberCharacters)


回答14:

If you\'re just looking to grab the numbers from the string, you could certainly use regular expressions to parse them out. For doing regex in Objective-C, check out RegexKit. Edit: As @Nathan points out, using NSScanner is a much simpler way to parse all numbers from a string. I totally wasn\'t aware of that option, so props to him for suggesting it. (I don\'t even like using regex myself, so I prefer approaches that don\'t require them.)

If you want to format phone numbers for display, it\'s worth taking a look at NSNumberFormatter. I suggest you read through this related SO question and this iPhone-specific tutorial for tips on doing so. Remember that phone numbers are formatted differently depending on location and/or locale.



回答15:

Um. The first answer seems totally wrong to me. NSScanner is really meant for parsing. Unlike regex, it has you parsing the string one tiny chunk at a time. You initialize it with a string, and it maintains an index of how far along the string it\'s gotten; That index is always its reference point, and any commands you give it are relative to that point. You tell it, \"ok, give me the next chunk of characters in this set\" or \"give me the integer you find in the string\", and those start at the current index, and move forward until they find something that doesn\'t match. If the very first character already doesn\'t match, then the method returns NO, and the index doesn\'t increment.

The code in the first example is scanning \"(123)456-7890\" for decimal characters, which already fails from the very first character, so the call to scanCharactersFromSet:intoString: leaves the passed-in strippedString alone, and returns NO; The code totally ignores checking the return value, leaving the strippedString unassigned. Even if the first character were a digit, that code would fail, since it would only return the digits it finds up until the first dash or paren or whatever.

If you really wanted to use NSScanner, you could put something like that in a loop, and keep checking for a NO return value, and if you get that you can increment the scanLocation and scan again; and you also have to check isAtEnd, and yada yada yada. In short, wrong tool for the job. Michael\'s solution is better.



回答16:

For those searching for phone extraction, you can extract the phone numbers from a text using NSDataDetector, for example:

NSString *userBody = @\"This is a text with 30612312232 my phone\";
if (userBody != nil) {
    NSError *error = NULL;
    NSDataDetector *detector = [NSDataDetector dataDetectorWithTypes:NSTextCheckingTypePhoneNumber error:&error];
    NSArray *matches = [detector matchesInString:userBody options:0 range:NSMakeRange(0, [userBody length])];
    if (matches != nil) {
        for (NSTextCheckingResult *match in matches) {
            if ([match resultType] == NSTextCheckingTypePhoneNumber) {
                DbgLog(@\"Found phone number %@\", [match phoneNumber]);
            }
        }
    }
}

`



回答17:

I created a category on NSString to simplify this common operation.

NSString+AllowCharactersInSet.h

@interface NSString (AllowCharactersInSet)

- (NSString *)stringByAllowingOnlyCharactersInSet:(NSCharacterSet *)characterSet;

@end

NSString+AllowCharactersInSet.m

@implementation NSString (AllowCharactersInSet)

- (NSString *)stringByAllowingOnlyCharactersInSet:(NSCharacterSet *)characterSet {
    NSMutableString *strippedString = [NSMutableString
                                   stringWithCapacity:self.length];

    NSScanner *scanner = [NSScanner scannerWithString:self];

    while (!scanner.isAtEnd) {
        NSString *buffer = nil;

        if ([scanner scanCharactersFromSet:characterSet intoString:&buffer]) {
            [strippedString appendString:buffer];
        } else {
            scanner.scanLocation = scanner.scanLocation + 1;
        }
    }

    return strippedString;
}

@end


回答18:

I think currently best way is:

phoneNumber.replacingOccurrences(of: \"\\\\D\",
                               with: \"\",
                            options: String.CompareOptions.regularExpression)


回答19:

swift 4.1

var str = \"75003 Paris, France\"
var stringWithoutDigit = (str.components(separatedBy:CharacterSet.decimalDigits)).joined(separator: \"\")
print(stringWithoutDigit)


回答20:

Based on Jon Vogel\'s answer here it is as a Swift String extension along with some basic tests.

import Foundation
extension String {
    func stringByRemovingNonNumericCharacters() -> String {
        return self.componentsSeparatedByCharactersInSet(NSCharacterSet.decimalDigitCharacterSet().invertedSet).joinWithSeparator(\"\")
    }
}

And some tests proving at least basic functionality:

import XCTest

class StringExtensionTests: XCTestCase {

    func testStringByRemovingNonNumericCharacters() {

        let baseString = \"123\"
        var testString = baseString
        var newString = testString.stringByRemovingNonNumericCharacters()
        XCTAssertTrue(newString == testString)

        testString = \"a123b\"
        newString = testString.stringByRemovingNonNumericCharacters()
        XCTAssertTrue(newString == baseString)

        testString = \"a=1-2_3@b\"
        newString = testString.stringByRemovingNonNumericCharacters()
        XCTAssertTrue(newString == baseString)

        testString = \"(999) 999-9999\"
        newString = testString.stringByRemovingNonNumericCharacters()
        XCTAssertTrue(newString.characters.count == 10)
        XCTAssertTrue(newString == \"9999999999\")

        testString = \"abc\"
        newString = testString.stringByRemovingNonNumericCharacters()
        XCTAssertTrue(newString == \"\")
    }
}

This answers the OP\'s question but it could be easily modified to leave in phone number related characters like \",;*#+\"



回答21:

NSString *originalPhoneNumber = @\"(123) 123-456 abc\";
NSCharacterSet *numbers = [[NSCharacterSet characterSetWithCharactersInString:@\"0123456789\"] invertedSet];
NSString *trimmedPhoneNumber = [originalPhoneNumber stringByTrimmingCharactersInSet:numbers];

];

Keep it simple!