Python库介绍
2018-08-01 本文已影响30人
ACphart
括号中的数字代表在GitHub上获得的star数量,里面为2018年8月的数据。
主要参照GItHub上vinta的awesome-python项目 vinta/awesome-python
机器学习 Machine Learning.
1. scikit-learn (29647*)
- The most popular Python library for Machine Learning.
2. XGboost (12944*)
- A scalable, portable, and distributed gradient boosting library.
自然语言处理 Natural Language Processing.
1. NLTK (6685*)
- A leading platform for building Python programs to work with human language data.
2. jieba (14511*)
- Chinese text segmentation.
3. gensim (7480*)
- Topic Modelling for Humans.
4. Pattern (6412*)
- A web mining module for the Python.
5. TextBlob (5335*)
- Providing a consistent API for diving into common NLP tasks.
6. spaCy (10178*)
- A library for industrial-strength natural language processing in Python and Cython.
7. SnowNLP (3344*)
- A library for processing Chinese text.
8. PyTorch-NLP (982*)
- Supporting Rapid Prototyping with a Toolkit .
推荐系统 Recommendation System.
1. Surprise (2043*)
- A scikit for building and analyzing recommender systems.
2. spotlight (1122*)
- Deep recommender models using PyTorch.
3. lightFM (1858*)
- A Python implementation of a number of popular recommendation algorithms.
4. annoy (3857*)
- Approximate Nearest Neighbors in C++/Python optimized for memory usage.
科学计算 Scientific Computing.
1. Numpy (7852*)
- A fundamental package for scientific computing with Python.
2. Sympy (5027*)
- A computer algebra system written in pure Python
3. Scipy (4755*)
- A Python-based ecosystem of open-source software for mathematics, science, and engineering.
4. statsmodels (3101*)
- Statistical modeling and econometrics in Python.
5. colour (570*)
- A colour science package implementing a comprehensive number of colour theory transformations and algorithms.
特殊格式处理 Specific Formats Processing.
1. tablib (3069*)
- A module for Tabular Datasets in XLS, CSV, JSON, YAML.
2. PyPDF2 (2003*)
- A library capable of splitting, merging and transforming PDF pages.
3. pdfminer (2978*)
- A tool for extracting information from PDF documents.
爬虫 Web Crawling.
1. Scrapy (28463*)
- A fast high-level screen scraping and web crawling framework.
2. Pyspider (11732*)
- A powerful spider system.
3. portia (6213*)
- Visual scraping for Scrapy.
4. MechanicalSoup (2785*)
- A Python library for automating interaction with websites.
计算机视觉 Computer Vision.
1. OpenCV (26580*)
- Open Source Computer Vision Library.
2. SimpleCV (2158*)
- An open source framework for building computer vision applications.
数据分析 Data Analysis.
1. Pandas (15335*)
- A library providing high-performance, easy-to-use data structures and data analysis tools.
2. blaze (2435*)
- NumPy and Pandas interface to Big Data.
3. orange3 (1395*)
- Data mining, data visualization, analysis and machine learning through visual programming or scripts.
数据可视化 Data Visualization.
1. Matplotlib (7652*)
- A Python 2D plotting library.
2. Seaborn (5122*)
- Statistical data visualization using Matplotlib.
3. bokeh (7957*)
- Interactive Web Plotting for Python.
4. Altair (2614*)
- Declarative statistical visualization library for Python.
5. pygal (1902*)
- A Python SVG Charts Creator.
6. bqplot (1984*)
- Interactive Plotting Library for the Jupyter Notebook.
7. pyQtGraph (1178*)
- Interactive and realtime 2D/3D/Image plotting and science/engineering widgets.
8. geoplotlib (656*)
- python toolbox for visualizing geographical data and making maps.
深度学习 Deep Learning.
1. caffe (25051*)
- A fast open framework for deep learning.
2. Keras (32096*)
- A high-level neural networks library and capable of running on top of either TensorFlow or Theano.
3. MXNet (14756*)
- A deep learning framework designed for both efficiency and flexibility.
4. TensorFlow (106421*)
- The most popular Deep Learning framework created by Google.
5. PyTorch (17515*)
- Tensors and Dynamic neural networks in Python with strong GPU acceleration.
6. Theano (8393*)
- A library for fast numerical computation.
7. SerpentAI (4458*)
- Game agent framework. Use any video game as a deep learning sandbox.
8. neupy (468*)
- Running and testing different Artificial Neural Networks algorithms.
地理位置 Geolocation.
1. geopy (1977)
- Python Geocoding Toolbox.
GUI
1. Tkinter
- Tkinter is Python's de-facto standard GUI package.
2. PyQt
- Python bindings for the Qt cross-platform application and UI framework, with support for both Qt v4 and Qt v5 frameworks.
3. Flexx (1666*)
- Flexx is a pure Python toolkit for creating GUI's, that uses web technology for its rendering.
游戏开发 Game Development.
1. Cocos2d (460*)
- graphic library for games and multimedia, for python language on PC-like hardware
1. Cocos2d-x (12197*)
- Cocos2d-x is a suite of open-source, cross-platform, game-development tools used by millions of developers all over the world.
2. Panda3D (1162*)
- 3D game engine developed by Disney and maintained by Carnegie Mellon's Entertainment Technology Center. Written in C++, completely wrapped in Python.
3. Pygame (524*)
- Pygame is a set of Python modules designed for writing games.
4. Cocos2d-html5 (2418*)
- Cocos2d-html5 is a cross-platform 2D game engine written in JavaScript, based on Cocos2d-X and licensed under MIT.
图像处理 Image Processing.
1. pillow (5164*)
- Pillow is the friendly PIL fork.
2. thumbor (6054*)
- A smart imaging service. It enables on-demand crop, re-sizing and flipping of images.
3. scikit-image (2427*)
- A Python library for (scientific) image processing.
文本处理 Text Processing.
1. ftfy (2264*)
- Makes Unicode text less broken and more consistent automagically.
2. fuzzywuzzy (4471*)
- Fuzzy String Matching.
3. Pypinyin (1127*)
- Convert Chinese hanzi to pinyin.
4. textdistance (1200*)
- Compute distance between sequences. 30+ algorithms, pure python implementation, common interface, optional external libs usage.
音频处理类 Audio Manipulating.
1. eyeD3 (98*)
- eyeD3 is a Python tool for working with audio files, specifically MP3 files containing ID3 metadata.
2. pydub (2733*)
- Manipulate audio with a simple and easy high level interface.
3. dejavu (3946*)
- Audio fingerprinting and recognition.
4. pyAudioAnalysis (1668*)
- Python Audio Analysis Library: Feature Extraction, Classification, Segmentation and Applications.
5. m3u8 (429*)
- A module for parsing m3u8 file.