Link Archive

API Tokens: A Tedious Survey · Fly

https://fly.io/blog/api-tokens-a-tedious-survey/

t’s 2021 and so I don’t need to tell you that having your API pass a username and password through HTTP basic authentication is a bad idea. Your tokens should look large and random, whatever they are.

Added on: 2021-08-25 | view

Home - Diátaxis

https://diataxis.fr/

The Diátaxis framework aims to solve the problem of structure in technical documentation. It adopts a systematic approach to understanding the needs of documentation users in their cycle of interaction with a product.

Added on: 2021-08-23 | view

Introduction - Programming Design Systems

https://www.programmingdesignsystems.com/introduction/

Added on: 2021-08-22 | view

✔️ ❤️ ★ Unicode Character Table

https://unicode-table.com/en/

Added on: 2021-08-18 | view

4umi Bookmarklets Masterlist

http://4umi.com/web/bookmarklet/all.php

Added on: 2021-08-17 | view

Things To Do With Rhubarb: 10 Creative Ideas (No Pie!) | Craftsy

https://www.craftsy.com/post/things-to-do-with-rhubarb/

Added on: 2021-07-30 | view

Get ClamAV running on Mac OS X (using Homebrew)

https://gist.github.com/mendozao/3ea393b91f23a813650baab9964425b9

Get ClamAV running on Mac OS X (using Homebrew)

Added on: 2021-07-27 | view

Schema Markup validator

https://validator.schema.org/

Added on: 2021-07-26 | view

Python Import System Internals

https://tenthousandmeters.com/blog/python-behind-the-scenes-11-how-the-python-i…

Added on: 2021-07-25 | view

Turn off Topic Suggestions and Interests at Twitter with this handy script - Neowin

https://www.neowin.net/news/turn-off-topic-suggestions-and-interests-at-twitter…

Added on: 2021-07-13 | view

Schematron: validating XML using XSLT

http://ldodds.com/papers/schematron_xsltuk.html

Added on: 2021-07-09 | view

Apache OpenNLP Developer Documentation

https://opennlp.apache.org/docs/1.9.3/manual/opennlp.html#tools.cli.languagemod…

The Apache OpenNLP library is a machine learning based toolkit for the processing of natural language text. It supports the most common NLP tasks, such as tokenization, sentence segmentation, part-of-speech tagging, named entity extraction, chunking, parsing, and coreference resolution. These tasks are usually required to build more advanced text processing services. OpenNLP also includes maximum entropy and perceptron based machine learning.

Added on: 2021-07-05 | view

Group thousands of similar spreadsheet text cells in seconds | by Luke Whyte | Towards Data Science

https://towardsdatascience.com/group-thousands-of-similar-spreadsheet-text-cell…

Group thousands of similar spreadsheet text cells in seconds String matching in Python with TF-IDF and Cosine Similarity

Added on: 2021-07-05 | view

Text Analytics – Bag of Words – Darrin Bishop

http://www.darrinbishop.com/blog/2017/09/text-analytics-bag-of-words/

Bag of Words In its simplest form, BOW is a list of distinct words in a document and a word count for each word. BOW is a simple model to represent text as a numerical structure. Consider the term “document” to be any text you can access regardless of the format, from text in a word document to just a standalone string variable. I leave it up to you to extract the text from whatever format you are working with.

Added on: 2021-07-05 | view

pandas documentation — pandas 1.3.0 documentation

https://pandas.pydata.org/pandas-docs/stable/index.html

Added on: 2021-07-05 | view

Handling Sparse matrix — Concept behind Compressed Sparse Row (CSR) matrix | by Saishruthi Swaminathan | Towards Data Science

https://towardsdatascience.com/handling-sparse-matrix-concept-behind-compressed…

Scipy offers variety of sparse matrices functions that store only non-zero elements. By doing so, memory required for data storage can be minimized. Machine learning process often requires data frame to be in memory. It breaks down the data frame for fitting into RAM. By compressing, data can easily fit in RAM. Performing operations using only non-zero values of the sparse matrix can greatly increase execution speed of the algorithm.

Added on: 2021-07-05 | view

Boosting Selection Of Most Similar Entities In Large Scale Datasets | Sun Analytics

https://www.sun-analytics.nl/posts/2017-07-26-boosting-selection-of-most-simila…

Comparing very large feature vectors and picking the best matches, in practice often results in performing a sparse matrix multiplication followed by selecting the top-n multiplication results. In this blog, we implement a customized Cython function for this purpose. When comparing our Cythonic approach to doing the same with SciPy and NumPy functions, our approach improves the speed by about 40% and reduces memory consumption. The GitHub code of our approach is available here.

Added on: 2021-07-05 | view