python - Perceptual hash function for text -
does knows simple perceptual hash algorithm text ? took in phash function ph_texthash
want more simple algorithm. preferably in python. thank !
a blog post perceptual hash functions (in imaging context):
and related python code (dealing images, not text, may adaptable):
- http://sprunge.us/wcvj?py (53 loc)
as understand short presentation perceptual hashing of textual content, there numerous approaches (in different dimensions such level of text, linguistic or statistical approach, model chosen represent text, ...), , right 1 depend on domain , problems try solve.
also might locality-sensitive hashing,
is method of performing probabilistic dimension reduction of high-dimensional data. basic idea hash input items similar items mapped same buckets high probability (the number of buckets being smaller universe of possible input items)
Comments
Post a Comment