数据结构 | 列表

2019-07-15 本文已影响0人简子逍

基本用法

创建数字列表

numbers = list(range(1,4))
print(numbers)  # [1, 2, 3]

访问列表中的值

car = ['audi', 'bmw', 'benchi', 'lingzhi']
print(car[0])          # audi
print(car[0].title())  # Audi
print(car[1:3])        # ['bmw', 'benchi']

删除列表中重复元素

功能：删除列表中重复出现的元素，并且保持剩下元素的显示顺序不变。

如果一个对象是可散列的（hashable），那么在它的生存期内必须是不可变的，这需要有一个__hash__方法。在 Python 程序中，整数、浮点数、字符串和元组都是不可变的。

a = 1
print(hash(a))  # 1
b = 1.0
print(hash(b))  # 1
c = 1.1
print(hash(c))  # 230584300921369601
d = "Hello"
print(hash(d))  # -2004459555708441633
e = (1, 2, (3, 4))
print(hash(e))  # -2725224101759650258
f = (1, 2, [3, 4])
print(hash(f))  # TypeError: unhashable type: 'list'
g = {'x':1}
print(hash(g))  # TypeError: unhashable type: 'dict'

如果序列中保存的元素是可散列的，那么上述功能可以使用集合和生成器实现：

def dedupe(items):
    seen = set()
    for item in items:
        if item not in seen:
            yield item
            seen.add(item)

if __name__ == '__main__':
    a = [5, 5, 2, 1, 9, 1, 5, 10]
    print(a)                # [5, 5, 2, 1, 9, 1, 5, 10]
    print(list(dedupe(a)))  # [5, 2, 1, 9, 10]

如果序列中元素是不可散列的，可以将序列中元素转换为可散列的类型：

def buha(items, key=None):
    seen = set()
    for item in items:
        val = item if key is None else key(item)
        if val not in seen:
            yield item
            seen.add(val)

if __name__ == '__main__':
    a = [
        {'x': 2, 'y': 3},
        {'x': 1, 'y': 4},
        {'x': 2, 'y': 3},
        {'x': 2, 'y': 3},
        {'x': 10, 'y': 15},
    ]
    print(a)                                             # [{'x': 2, 'y': 3}, {'x': 1, 'y': 4}, {'x': 2, 'y': 3}, {'x': 2, 'y': 3}, {'x': 10, 'y': 15}]
    print(list(buha(a, key=lambda a: (a['x'],a['y']))))  # [{'x': 2, 'y': 3}, {'x': 1, 'y': 4}, {'x': 10, 'y': 15}]

找出列表中次数最多的数

使用collections模块中的Counter类，调用Counter类中的函数most_common()来实现功能。

from collections import Counter

words = [
    'look', 'into', 'AAA', 'cdf', 'my', 'AAA',
    'the', 'AAA', 'into', 'the', 'my', 'MY',
    'BBB', 'AAA', 'look', 'BBB', 'not', 'the',
    'AAA','into', 'CCC','the'
]
word_counts = Counter(words)
top_three = word_counts.most_common(3)
print(top_three)  # [('AAA', 5), ('the', 4), ('into', 3)]

排序类定义的实例

使用内置函数sorted()可以接收一个用来传递可调用对象的参数key，而这个可调用对象会返回待排序对象中的某些值，sorted函数则利用这些值来比较对象。

class User:
    def __init__(self, user_id):
        self.user_id = user_id

    def __repr__(self):
        return 'User({})'.format(self.user_id)

# 原来的顺序
users = [User(19), User(17), User(18)]
print(users)                                     # [User(19), User(17), User(18)]

# 根据 user_id 排序
print(sorted(users, key=lambda u: u.user_id))    # [User(17), User(18), User(19)]
# 使用内置函数 operator.attrgetter()
from operator import attrgetter
print(sorted(users, key=attrgetter('user_id')))  # [User(17), User(18), User(19)]

列表推导式

语法格式：variable = [out_exp_res for out_exp in input_list if out_exp == 2]

获取 30 以内能够整除 3 的整数，然后依次输出所获得的整数的平方：

# 传统方式
numbers = []
for x in range(30):
    if x % 3 == 0:
        numbers.append(x*x)
print(numbers)    # [0, 9, 36, 81, 144, 225, 324, 441, 576, 729]

# 列表推导式
multiples = [i*i for i in range(30) if i % 3 is 0]
print(multiples)  # [0, 9, 36, 81, 144, 225, 324, 441, 576, 729]

filter函数把处理筛选功能的代码放到单独的功能函数中，用于筛选过程涉及异常处理或者其他一些复杂细节的情况。

def is_int(val):
    try:
        x = int(val)
        return True
    except ValueError:
        return False

values = ['1', '2', '-3', '-', '4', 'N/A', '5']
ivals = list(filter(is_int, values))
print(ivals)  # ['1', '2', '-3', '4', '5']