作者:请让我来打酱油 | 来源:互联网 | 2023-02-05 16:46
我想编写一个可以像这样使用的函数:
let ??? = "???".replacingFirstOccurrence(of: "", with: "")
鉴于这个字符串和Swift的String
库有多奇怪,这在Swift中是否可行?
1> Martin R..:
根据获得的见解为什么像????这样的表情符号字符在Swift字符串中处理得如此奇怪?,一种明智的方法可能是替换Unicode标量:
extension String {
func replacingFirstOccurrence(of target: UnicodeScalar, with replacement: UnicodeScalar) -> String {
let uc = self.unicodeScalars
guard let idx = uc.index(of: target) else { return self }
let prefix = uc[uc.startIndex..
例:
let family1 = "???"
print(family1.characters.map { Array(String($0).unicodeScalars) })
// [["\u{0001F469}", "\u{200D}"], ["\u{0001F469}", "\u{200D}"], ["\u{0001F467}", "\u{200D}"], ["\u{0001F466}"]]
let family2 = family1.replacingFirstOccurrence(of: "", with: "")
print(family2) // ???
print(family2.characters.map { Array(String($0).unicodeScalars) })
// [["\u{0001F469}", "\u{200D}"], ["\u{0001F469}", "\u{200D}"], ["\u{0001F466}", "\u{200D}"], ["\u{0001F466}"]]
这是一个可能的版本,它定位并替换任意字符串的Unicode标量:
extension String {
func replacingFirstOccurrence(of target: String, with replacement: String) -> String {
let uc = self.unicodeScalars
let tuc = target.unicodeScalars
// Target empty or too long:
if tuc.count == 0 || tuc.count > uc.count {
return self
}
// Current search position:
var pos = uc.startIndex
// Last possible position of `tuc` within `uc`:
let end = uc.index(uc.endIndex, offsetBy: tuc.count - 1)
// Locate first Unicode scalar
while let from = uc[pos..
@OlegGordiichuk:看看OP上一个问题http://stackoverflow.com/questions/43618487/why-is-treated-so-strangely-in-swift-strings,这正是这个问题.
2> xoudini..:
使用该range(of:options:range:locale:)
解决方案变得非常简洁:
extension String {
func replaceFirstOccurrence(of searchString: String, with replacementString: String) -> String {
guard let range = self.range(of: searchString, options: .literal) else { return self }
return self.replacingCharacters(in: range, with: replacementString)
}
}
这首先找到searchString
实例中的范围,如果找到范围,则替换为范围replacementString
.否则,实例只返回自身.并且,由于该range(of:)
方法在找到匹配后立即返回,因此保证返回的范围是第一次出现的.
"221".replaceFirstOccurrence(of: "2", with: "3") // 321
"???".replaceFirstOccurrence(of: "\u{1f469}", with: "\u{1f468}") // ???
*为了澄清,最后一个测试案例将女性 - 女性 - 女孩 - 男孩转变为男女女孩.
使用`.literal`选项确实优雅且更容易.
`.literal`被记录为"Exact character-by-character equivalence",但显然"character"并不意味着在这种情况下的Swift`Character`.我的猜测是它实际上意味着"精确的Unicode标量等价"或"精确的UTF-16等价"(因为`.literal`在`NSString.CompareOptions`中定义,而`NSString`基于`unichar`).