Email
Telegram

Kavunka Software Settings File

You can scale the search engine based on the capabilities of your hardware. From a virtual machine with one core and one gigabyte of RAM to a high-performance server or even a cluster. You can change the value of constants in the file "../kavunka/TFiles/config.t" constants.

SOFT_ID - to run multiple search engines on one server. Each such search engine should have its own unique SOFT_ID [0-9]
TIME_ZONE - server time zone
NUM_WORDS - number of words in the buffer
  • d = [0,1,2,3 ... FOLDERS_PER_SITE]
  • n = [0,1,2,3 ... NUM_ALL_WORDS/NUM_WORDS]
NUM_WORDS_SW - number of words in one file "/sfwords/n.t"
  • n = [0,1,2,3 ... MAX_HASH_WORD]
MAX_HASH_WORD - see NUM_WORDS_SW
MAX_DOMAINS_SRV - maximum number of sites in a search engine
MAX_URLS_SITE - maximum number of pages for one website
FOLDERS_PER_SITE - how many folders to distribute data from one website
MAX_TENTACLE - maximum number of tentacle-search for one octopus-search
NUM_ALL_WORDS - maximum number of words per website
N_BED_URLS - number of bad urls
N_URLS_FOR_SCAN - maximum number of lines (urls) in files "/sites/domain.com_d/urlsforscan.t" and "/sites/domain.com_d/urlspriorscan.t"
MAX_HASHES - maximum hashes of unique paragraphs
STATS_NOT_UNIQ_PARAG - take words from non-unique paragraphs?
  • 0 - not
  • 1 - yes
MIN_LEN_PARAG - minimum paragraph length for participating in word indexing
MAX_ALL_SITES_WORDS - dictionary size
MAX_OCTOPUS maximum number of octopuses that can work
SEARCH_TOP - maximum number of topers that can work (increases search quality)
MAX_KEYS - the maximum number of keys that are used to prompt users
MIN_WORDS_IN_KAY - minimum number of words per key
MAX_WORDS_IN_KAY - maximum number of words per key
KEYS_LEN_INDEX - maximum key length
KEYS_NUMBERS_INDEX - maximum number of keys for each character
KEYS_TOP_INDEX - how many keys should be displayed in the prompt to users
MAX_INDLINK - global index depth
KAVUNKA
Personal Search Engine
Powerful Crawlers
Fast WEB Scraping
Try for Free