Simplest way to extract first Unicode codepoint of

For historical reasons, Cocoa's Unicode implementation is 16-bit: it handles Unicode characters above 0xFFFF via "surrogate pairs". This means that the following code is not going to work:

NSString myString = @"


               
                
                   
                        
                        标签：
                            
                              
                                                                      cocoa
                           
               
                  nsstring
                           
               
                  surrogate-pairs
                           
               
                                
                           
                        
                    
                    
                                                   
                        
                                          
                        
                        
                        
                        
                        举报


   
    



        
        
        
        
        1条回答

           
       
           
           
           
                                              
            
                                  
            
            
            
            
            
            手持菜刀，她持情操                          
            
             
             2楼-- · 2020-04-08 12:20
             
             
             
                          
             
                                                                          
A single Unicode code point might be a Surrogate Pair, but also not all language characters are single code points. i.e. not all language characters are represented by one or two UTF-16 units. Many characters are represented by a sequence of Unicode code points. 

This means that unless you are dealing with Ascii you have to think of language characters as substrings, not unicode code points at indexes. 

To get the substring for the character at index 0:

NSRange r = [[myString rangeOfComposedCharacterSequenceAtIndex:0];
[myString substringWithRange:r];


This may or may not be what you want depending on what you are actually hoping to do. e.g. although this will give you 'character boundaries' these won't correspond to cursor insertion points, which are language specific.
    
                                                                    
                                                        
            
              
                查看更多
                
             
              0人赞

                                                     添加讨论(0)

                                                                                                            
                               举报
                
                
                
                  
                


                        
                            

                               
             
                        
               
            

                            
                            
                                 加载中...
                            
                        

                
   
   
               
               
     
                      登录 后发表回答



   
   
   
  
   相关问题
      
    
    
   
   

     


   
   NSOutlineView drag line stuck + blue border   

   



     


   
   iphone sdk see size of local file (one created   

   



     


   
   iOS SecKeyRef from NSString   

   



     


   
   How can you detect the connection and disconnectio   

   



     


   
   QuickLook Plugin Failing with sandboxing error   

   



        
      
    查看全部
   
   
  
   相关文章
 
   
   

     


   
   Converting (u)int64_t to NSNumbers   

     


   
   “getter” keyword in @property declaration in Objec   

     


   
   NSMenuItem KeyEquivalent “ ”(space) bug   

     


   
   In Objective-C, how to print out N spaces? (using    

     


   
   Detect if cursor is hidden on Mac OS X   

     


   
   NSNumberFormatter doesn't allow typing decimal   

     


   
   Is subclassing NSNotification the right route if I   

     


   
   Creating an NSMutableArray with a literal via muta   

        
        
    查看全部
                 收藏的人(5)

Simplest way to extract first Unicode codepoint of

采纳回答

编辑标签

举报内容

检举类型

检举原因

检举说明(必填)

打开微信“扫一扫”，打开网页后点击屏幕右上角分享按钮

付费偷看金额在0.1-10元之间