ИСТИНА |
Войти в систему Регистрация |
|
ИСТИНА ИНХС РАН |
||
A data processing method, apparatus and equipment, the method comprising: building a first data collection according to a similarity threshold j and target data, the target data including T1 first bit sets, the first data collection including M1 first data, the M1 first data corresponding one-to-one to M1 combinations of selecting j first bit sets from among the T1 first bit sets; generating N second data collections according to j and N stored data; the N stored data corresponding one-to-one to the N second data collections, each stored data including T2 second bit sets, each second data collection including M2 second data, each second data in the ith second data collection including the T2 second bit sets in the ith stored data, the M2 second data in the ith second data collection corresponding one-to-one to the M2 combinations of selecting j second bit sets from among the T2 second bit sets. Determining a first stored data from among N stored data according to the first data collection and second data collection may reduce the complexity of the lookup process of similar data.