12-day longest streak
-
gensim ★ PINNED
Topic Modelling for Humans
Python ★ 16k 7mo agoExplain → -
smart_open ★ PINNED
Utils for streaming large files (S3, HDFS, gzip, bz2...)
Python ★ 3.4k 22d agoExplain → -
bounter ★ PINNED
Efficient Counter that uses a limited (bounded) amount of memory regardless of data size.
Python ★ 932 3y agoExplain → -
word_embeddings ★ PINNED
Code for the blog post "Making Sense of Word2vec"
Python ★ 113 11y agoExplain → -
sqlitedict ★ PINNED
Persistent dict, backed by sqlite3 and pickle, multithread-safe.
Python ★ 1.2k 3y agoExplain → -
sim-shootout ★ PINNED
Code for "Performance shootout between nearest-neighbour libraries": http://radimrehurek.com/2013/11/performance-shootout-of-nearest-neighbours-intro
Python ★ 98 11y agoExplain → -
gensim-data
Data repository for pretrained NLP models and NLP corpora.
Python ★ 1.1k 8y agoExplain → -
topic_modeling_tutorial
Instructions & code for the EuroPython 2014 training session "Topic Modeling for Fun and Profit"
Python ★ 111 12y agoExplain → -
gensim-simserver
[NO LONGER MAINTAINED AS OPEN SOURCE - USE SCALETEXT.COM INSTEAD]
Python ★ 109 13y agoExplain → -
data_science_python
Source code for the "Practical Data Science in Python" tutorial
★ 58 10y agoExplain → -
sparsesvd
Python wrapper around SVDLIBC, a fast library for sparse Singular Value Decomposition
C ★ 55 13y agoExplain → -
PredatorPrey
Boid flocking model in OpenGL
C++ ★ 12 9y agoExplain → -
flann ⑂
Fast Library for Approximate Nearest Neighbors
C++ ★ 12 14y agoExplain → -
pattern ⑂
Web mining module for Python
Python ★ 6 14y agoExplain → -
pazpar2
No description.
C ★ 4 13y agoExplain → -
VectRast
Bitmaps to vectors for Elastomania
C# ★ 4 11y agoExplain → -
gensim-wheels
Repository to build and test Gensim wheels
Batchfile ★ 3 4y agoExplain → -
engineering-blogs ⑂
No description.
★ 3 11y agoExplain → -
indexed_gzip ⑂
Fast random access of gzip files in Python
C ★ 2 3y agoExplain → -
kibi ⑂
Kibi is a friendly - kept in sync - Kibana fork which add support for joins across indexes and external sources, tabbed navigation interface and more
JavaScript ★ 2 9y agoExplain → -
MITIE ⑂
MITIE: library and tools for information extraction
C++ ★ 2 9y agoExplain → -
gsoc ⑂
Numfocus Google Summer of Code Materials
★ 2 10y agoExplain → -
spotlight ⑂
Deep recommender models using PyTorch.
Python ★ 2 8y agoExplain → -
keras ⑂
Deep Learning for humans
Python ★ 2 7y agoExplain → -
scikit-learn ⑂
temporary fix for https://github.com/scikit-learn/scikit-learn/issues/6186
Python ★ 1 10y agoExplain → -
glove-python ⑂
Toy Python implementation of http://www-nlp.stanford.edu/projects/glove/
★ 1 10y agoExplain → -
falcon ⑂
Chrome extension for full text history search!
JavaScript ★ 1 9y agoExplain → -
fast-style-transfer ⑂
Fast Style Transfer in TensorFlow ⚡🖥🎨🖼
Python ★ 1 9y agoExplain → -
generating-reviews-discovering-sentiment ⑂
Code for "Learning to Generate Reviews and Discovering Sentiment"
Python ★ 1 9y agoExplain → -
hashrobot ⑂
A social media assistant.
HTML ★ 1 8y agoExplain → -
DAWG ⑂
DAFSA-based dictionary-like read-only objects for Python. Based on `dawgdic` C++ library.
★ 1 3y agoExplain → -
gunicorn ⑂
gunicorn 'Green Unicorn' is a WSGI HTTP Server for UNIX, fast clients and sleepy applications.
★ 1 2y agoExplain → -
opencv-python ⑂
Automated CI toolchain to produce precompiled opencv-python, opencv-python-headless, opencv-contrib-python and opencv-contrib-python-headless packages.
Shell ★ 1 1y agoExplain → -
opencv_contrib ⑂
Repository for OpenCV's extra modules
★ 1 1y agoExplain → -
pybloomfiltermmap ⑂
Fast Python Bloom Filter using Mmap
C ★ 1 10y agoExplain → -
spark-ec2 ⑂
Scripts used to setup a Spark cluster on EC2
Shell ★ 1 10y agoExplain → -
fuzzysearch-demo ⑂
Demo of fuzzy (nearest neighbor) search with dense data
Java ★ 1 10y agoExplain → -
dexter ⑂
Dexter is a framework that implements some popular algorithms and provides all the tools needed to develop any entity linking technique.
Java ★ 1 11y agoExplain → -
annoy ⑂
Approximate Nearest Neighbors in C++/Python optimized for memory usage and loading/saving to disk
C++ ★ 1 12y agoExplain → -
jusText ⑂
Heuristic based boilerplate removal tool
Python ★ 1 13y agoExplain → -
askbot-devel ⑂
ASKBOT is a StackOverflow-like Q&A forum, based on CNPROG.
Python ★ 1 14y agoExplain → -
redis-py ⑂
Redis Python Client
Python ★ 0 7y agoExplain → -
xlrd ⑂
Please use openpyxl where you can...
★ 0 1mo agoExplain → -
numfocus.org ⑂
NumFOCUS.org Website
CSS ★ 0 12y agoExplain → -
line_profiler ⑂
Line-by-line profiling for Python
Python ★ 0 7y agoExplain → -
gensim-feedstock ⑂
A conda-smithy repository for gensim.
Shell ★ 0 7y agoExplain → -
ann-benchmarks ⑂
Benchmarks of approximate nearest neighbor libraries in Python
Python ★ 0 11y agoExplain → -
spaCy ⑂
Industrial strength NLP with Python and Cython
Python ★ 0 11y agoExplain → -
random-projections-at-berlinbuzzwords ⑂
Demo of random projections at BerlinBuzzwords 2015
Scala ★ 0 10y agoExplain → -
DynamicPoissonFactorization ⑂
Dynamic version of Poisson Factorization (dPF). dPF captures the changing interest of users and the evolution of items over time according to user-item ratings.
C++ ★ 0 10y agoExplain → -
castra ⑂
partitioned storage system based on blosc
Python ★ 0 10y agoExplain → -
Kalman-and-Bayesian-Filters-in-Python ⑂
Kalman Filter textbook using Ipython Notebook. Focuses on building intuition and experience, not formal proofs. Includes Kalman filters,extended Kalman filters, unscented Kalman filters, particle filters, and more. All exercises include solutions.
Python ★ 0 11y agoExplain → -
git-tst
No description.
Python ★ 0 11y agoExplain → -
Conjecture ⑂
Scalable Machine Learning in Scalding
★ 0 11y agoExplain → -
python-blosc ⑂
A Python wrapper for the extremely fast Blosc compression library
★ 0 12y agoExplain → -
termite-data-server ⑂
Data Server for Topic Models
Python ★ 0 12y agoExplain → -
statsmodels ⑂
Statsmodels: statistical modeling and econometrics in Python
Python ★ 0 12y agoExplain → -
solr-vs-elasticsearch ⑂
The src for http://solr-vs-elasticsearch.com
PHP ★ 0 13y agoExplain → -
pymarc ⑂
process MARC records from Python
Python ★ 0 13y agoExplain →
No repos match these filters.