owned this note
owned this note
Published
Linked with GitHub
---
title: String
type: slide
tags: 資芽簡報
---
# String
2023 資訊之芽北區 Py 班
感謝去年講師熊老師育霆讓我~~抄襲~~修改他的簡報
---
## Recall: what are strings?
我們來複習第一週教過的東西
----
一段文字
```python=
print(type("Hello, world!")) # <class 'str'>
```
----
可以用單引號或雙引號包起來
```python=
"I love Python" # 用雙引號包起來
'I love Python' # 用單引號包起來
```
----
加法(串接)、乘法(重複接)
```python=
name = "Madoka"
print("Hello, " + name) # Hello, Madoka
print(name * 3) # MadokaMadokaMadoka
```
----
如果要用引號怎麼辦
```python=
print('Hello. It's me.')
```
```
File "<stdin>", line 1
print('Hello. It's me.')
^^^^^^^^^^^^
SyntaxError: invalid syntax. Perhaps you forgot a comma?
```
----
跳脫字元,例如 `\n`, `\'`
```python=
print('Hello. It\'s me.') # Hello. It's me.
print('Never gonna give you up', 'Never gonna let you down', 'Never gonna run around and desert you', sep='\n')
"""Never gonna give you up
Never gonna let you down
Never gonna run around and desert you
"""
```
----
多行字串
```python=
print("""這幾行
會被
印出來 :)""")
```
----
f-string
```python=
name = "Kira Yoshikage"
age = 33
print("name=", name, "age=", age) # name= Kira Yoshikage age= 33
print(f"name={name} age={age}") # name=Kira Yoshikage age=33
print(f"{name=} {age=}") # name='Kira Yoshikage' age=33
```
---
## 名詞解釋:字元 char
字元:一個字
```python=
'a'
'b'
'c'
```
字串:由一串字元組成
```python=
'a' + 'b' + 'c'
```
----
## 補充:Python 沒有 char
在 C, C++, Java 等程式語言中有 char (字元) 資料型別,但在 Python 中沒有 char 資料型別,只有長度為 1 的字串。
```c=
char c = 'a'; // C and C++
```
```python=
c = 'a' # Python
# is equivalent to
c = "a"
print(type(c))
```
---
## 跟 `list` 差不多一樣的地方
在許多語言,string 就是個 list of chars
但 Python 是個特例
----
### 加法和乘法運算
加法(串接)、乘法(重複接)
```python=
name = "Madoka"
print("Hello, " + name) # Hello, Madoka
print(name * 3) # MadokaMadokaMadoka
```
回憶一下 `list` 的對應操作
```python=
print(['c'] + [8, 7] + [6, 3]) # ['c', 8, 7, 6, 3]
print([8, 7] * 3) # [8, 7, 8, 7, 8, 7]
```
----
### 比較運算
兩字串是否相等
```python=
print("abc" == "abc") # True
print("abc" != "abc") # False
```
比較兩字串字典序
```python=
print("abc" < "abd") # True
print("abc" < "abcd") # True
print("abc" > "def") # False
```
[字典序](https://en.wikipedia.org/wiki/Alphabetical_order#Ordering_in_the_Latin_script)
簡單來說就是從頭比到尾、都一樣比長度
----
### `in`, `not in`
```python=
text = 'abc'
print('a' in text) # True
print('ab' in text) # True
print('ac' in text) # False
print('ac' not in text) # True
```
回憶一下 `list` 的對應操作
```python=
list1 = ['a', 'b', 'c']
print('a' in list1) # True
print('ab' in list1) # False <---- 注意這個
```
問題:怎麼比較 `'a', 'b'` 連續地出現在 list 中?
----
### `len`
```python=
greeting = "Hello"
print(greeting)
print(len(greeting)) # 5
```
----
### 取第 i 個
```python=
greeting = "Hello"
print(greeting[0]) # H
print(greeting[1]) # e
print(greeting[3]) # l
print(greeting[-1]) # o
print(greeting[5]) # IndexError: string index out of range
```
這跟 `greeting = ['H', 'e', 'l', 'l', 'o']` 的差別是?
----
### Slicing
```python=
greeting = "Hello"
print(greeting[2:4]) # ll
print(greeting[:3]) # Hel
print(greeting[2:]) # llo
print(greeting[::-1]) # olleH
print(greeting[:-3:-1]) # ol
```
這跟 `greeting = ['H', 'e', 'l', 'l', 'o']` 的差別是?
----
### 可迭代 Iterable
如果你忘記什麼是 iterable,就是一句話:可以 for 它
```python=
greeting = "Hello"
for c in greeting:
print(c)
```
---
## 小補充 iterable
[還不錯的教學文件](https://www.pythonlikeyoumeanit.com/Module2_EssentialsOfPython/Iterables.html)
我們來看一下 definition 和一些花式用法
---
## 跟 list 不一樣的地方
----
### Immutable 不能改值
```python=
text = "abcde"
text[2] = 'f'
# TypeError: 'str' object does not support item assignment
```
----
如果要改掉某個字的話只能很麻煩地這樣做
```python=
text = "abcde"
new_text = text[:2] + "f" + text[3:]
print(new_text) # abfde
```
----
遠超過範圍的補充:[為什麼 string 是 immutable](https://stackoverflow.com/questions/8680080/why-are-python-strings-immutable-best-practices-for-using-them)
----
沒有 immutability 的話可能會發生一些怪事
```python=
a = [8, 7, 6, 3]
b = a
a[2] = 5
print(f"{a=}") # a=[8, 7, 5, 3]
print(f"{b=}") # b=[8, 7, 5, 3]
# Remark: use id
print(f"{id(a)=}")
print(f"{id(b)=}")
```
回憶一下為何我們會看到 a, b 會同時被改值
這是因為他們 reference 到同一個物件
----
[Why is immutability so important (or needed) in JavaScript?](https://stackoverflow.com/questions/34385243/why-is-immutability-so-important-or-needed-in-javascript)
雖然不是 Python 但是核心精神差不多,可以參考看看
----
課間練習:
輸入一行字,輸出第 $i$ 行把第 $i$ 個字元取代成 `*` 後印出
Input:
```txt
abcdefg
```
Output:
```txt
*bcdefg
a*cdefg
ab*defg
abc*efg
abcd*fg
abcde*g
abcdef*
```
----
範例解答
```python=
text = input()
for i in range(len(text)):
print(text[:i] + '*' + text[i+1:])
```
---
## String 的各種操作 (method)
----
String 的操作有[這麼多](https://www.w3schools.com/python/python_ref_string.asp),我們只能帶到幾個常用的
----
如果你想知道有沒有什麼簡單的方法達成你想要做的事情,可以試著搜尋看看
- Python turn string uppercase (把每個字元轉成大寫)
- Python string first capital (把第一個字轉成大寫)
- Python string remove blank (把空白拿掉)
---
### replace
就是搜尋與取代
```python=
lyrics = """Never gonna give you up
Never gonna let you down
Never gonna run around and desert you
"""
modified_lyrics = lyrics.replace("Never", "Always")
print(modified_lyrics)
print(lyrics)
```
注意:原本的字串不會被改動
----
練習一下
1. 把前面範例中的 "gonna" 取代成 "wanna"
2. 把前面範例中的 "gonna" 取代成 "wanna",並且把 "you" 取代成 "me"
----
範例解答
```python=
new_lyrics = lyrics.replace("gonna", "wanna")
```
```python=
new_lyrics = lyrics.replace("gonna", "wanna").replace("you", "me")
```
---
### isdigit & isalpha & isalnum
判斷字串中是否全為數字/字母/數字與字母
----
```python=
print("119".isdigit()) # True
print("Hello".isalpha()) # True
print("Hello, it's me".isalpha()) # False
print("C8763".isalnum()) # True
```
當然還有很多其他的 is 家族,可以慢慢查
----
當然,我們可以不用 `isdigit` 暴力做到這些事情
```python=
s = "271828"
flag = True
for ch in s:
if not "0" <= ch <= "9":
# if ch not in "0123456789":
flag = False
print(flag)
```
但這樣很累欸
----
練習時間
搭配 `isdigit` 和 `len` 判斷一串數字是否為合法的手機號碼
1. 十個數字
2. 前兩位是 09
```python=
text = input()
# 請開始你的練習
```
----
範例解答
```python=
text = input()
print(len(text) == 10 and text[:2] == "09" and text.isdigit())
```
---
### strip
```python=
new_text = " aaaaa ".strip()
print(f"{new_text=}") # new_text='aaaaa'
```
把字串頭尾的空白拿掉,並且回傳新的字串
---
### lower & upper
將字串中的英文字母全變成小寫或大寫
```python=
print("Ko no Dio da!".upper()) # KO NO DIO DA!
print("Ko no Dio da!".lower()) # ko no dio da!
```
----
一些使用情境:不分大小寫搜尋某個字串,例如說
```python=
want_to_search = "Iso"
text = "Isometry, isomorphism, isolated"
text.lower().find(want_to_search.lower()) # 下一小節是 find
```
這可以搭配後面的其他東西做練習
---
### find
尋找
```python=
lyrics = """Never gonna give you up"""
print(lyrics.find("gonna")) # 6
```
----
用法
```python=
string.find(value, start, end)
```
* value: 欲尋找的東東
* start: 從哪裡開始找,預設是 0
* end: 找到哪裡結束,預設是到字串結尾
----
把所有 gonna 找到
```python=
lyrics = """Never gonna give you up
Never gonna let you down
Never gonna run around and desert you
"""
positions = []
start = 0
value = "gonna"
while True:
ret = lyrics.find(value, start)
if ret == -1:
break
positions.append(ret)
start = ret + len(value)
print(positions)
```
---
### split
把字串切成一堆字串用 list 包起來
```python=
text = "Never gonna give you up"
splitted = text.split()
print(splitted) # ['Never', 'gonna', 'give', 'you', 'up']
```
----
範例:計算你輸入了幾個英文單字
```python=
words = input().split()
print(len(words))
```
----
範例:計算一堆數字的平均
```python=
numbers = list(map(float, input().split()))
print(sum(numbers) / len(numbers))
```
----
更詳細的用法
```python=
string.split(separator, maxsplit)
```
`separator`: 用什麼來切,預設是空白
`maxsplit`: 最多切幾個
----
```python=
a = "1 2 3 4 5".split()
b = "1 2 3 4 5".split(' ')
c = "1 2 3 4 5".split('3')
print(f"{a=}\n{b=}\n{c=}")
```
`split()` 和 `split(' ')` 其實有一些微妙的差異,細節需要注意一下
----
這樣會噴錯,為什麼?
```python
d = list(map(int, "1 2 3 4 5".split(' ')))
```
```python
ValueError Traceback (most recent call last)
/tmp/ipykernel_409726/410874544.py in <module>
----> 1 d = list(map(int, "1 2 3 4 5".split(' ')))
ValueError: invalid literal for int() with base 10: ''
```
因為空字串不能被轉成 `int`
----
下面這個就不會出事
```python
d = list(map(int, "1 2 3 4 5".split()))
```
---
### join
把一堆用 list 包起來的字串合成一個字串
```python=
words = ['Never', 'gonna', 'give', 'you', 'up']
print(' '.join(words)) # Never gonna give you up
```
就是把 split 反過來
----
用法
```python
string.join(iterable)
```
用 `string` 把 `iterable` 裡面的東西全部接起來
`iterable` 裡面必須是一些字串
----
```python=
''.join(['1', '2', '3']) # 123
''.join(map(str, range(10))) # 123456789
', '.join(map(str, range(10))) # 0, 1, 2, 3, 4, 5, 6, 7, 8, 9
''.join([1, 2, 3]) # 會噴錯,為什麼?
```
---
## 綜合練習
----
計算一個英文句子中有幾個字
例如 "Hello, it's me." 有三個, "Never gonna give you up" 有五個字
----
判斷一個字是否是回文
例如 "ada", "racecar" 是,"sprout" 不是
----
你有另一個不打標點符號的檳友
總是用一些空白代替逗號,用 XD 代替句號
請你用 Python 幫他改成正確的標點符號
```python=
s = '我..熬夜...做..簡報.. 累死..了...XD' \
'熬夜..真的..會...變智障 我該..去睡覺XD'
# Ans: '我熬夜做簡報,累死了。熬夜真的會變智障,我該去睡覺。'
```
感謝竹區講師 [Sean64](https://hackmd.io/@Sean64/py-string#/5) 想出的優質題目
---
## 作業
[3109](https://neoj.sprout.tw/problem/3109/)
[3046](https://neoj.sprout.tw/problem/3046/)
---
End