python - Perceptual hash function for text -


does knows simple perceptual hash algorithm text ? took in phash function ph_texthash want more simple algorithm. preferably in python. thank !

a blog post perceptual hash functions (in imaging context):

and related python code (dealing images, not text, may adaptable):


as understand short presentation perceptual hashing of textual content, there numerous approaches (in different dimensions such level of text, linguistic or statistical approach, model chosen represent text, ...), , right 1 depend on domain , problems try solve.

also might locality-sensitive hashing,

is method of performing probabilistic dimension reduction of high-dimensional data. basic idea hash input items similar items mapped same buckets high probability (the number of buckets being smaller universe of possible input items)


Comments

Popular posts from this blog

c# - SharpSVN - How to get the previous revision? -

c++ - Is it possible to compile a VST on linux? -

url - Querystring manipulation of email Address in PHP -