3.4 – Data Dictionnary

Our data extraction methodology ensures a robust collection of structured information for over
100+ data points unified into one.

This gathered dataset encompasses vital details across multiple job categories, locations, and
essential job specifics. It empowers users to gain insights into employment trends, salary ranges, job
types, and other crucial facets of available job postings.

We tailor job datasets from 50+ job boards globally.

The following example illustrates a job posting from Indeed for a Filling Operator position at
“American Regent, Inc.” in the “Pharmaceutical and biotechnology” industry, based in New York,
NY. It includes details such as salary, job type, benefits, company information, and other relevant
job-specific attributes collected from Indeed job postings

1Company Identifiercompany_idUnique IdentifierA unique identifier is assigned to each company.
2CitycityTextThe city where the job is located.
3Job CategorycategoryTextThe category or type of job (e.g., Filling Operator III).
4Job TitletitleTextThe job title or position.
5Job IDjob-idUnique IdentifierA unique identifier is assigned to each job posting.
6Company NamecompanyTextThe name of the company offering the job.
7Company Logo URLcompany-logoUrlURL of the company’s logo.
8Company Ratingcompany ratingNumericThe rating of the company.
9Company Review CountcompanyReviewCountNumericThe count of reviews for the company.
10Company Overview LinkcompanyOverviewLinkURLLink to the company’s overview.
11IndustryindustryTextThe industry in which the company operates.
12Salary Textsalary-txtTextSalary information provided in text format (e.g., “$. an hour”).
13Salary Currencysalary-currencyTextThe currency used for salary figures.
14Salary Maximumsalary-maxNumericThe maximum salary offered for the job.
15Salary Minimumsalary-minNumericThe minimum salary offered for the job.
16 Salary Frequencysalary-frequencyTextFrequency of salary payment (e.g., hourly).
17LocationlocationTextThe specific job location (city, state).
18Job URLjob-URLURLWeb link to the job posting.
19Post Datepost-dateDateThe date when the job was posted.
20Post Date Timestamppost-date-tsTimestampTimestamp of the job posting.
21Job Typesjob-typesTextType of employment (e.g., Full-time).
22BenefitsbenefitsTextBenefits offered for the job.
23SchedulesschedulesTextJob schedule details.
24SourcesourceTextSource of the job data (e.g., Indeed).
25Crawled Datecrawled-dateDateDate when the job data was extracted.
26Crawled Timestampcrawled-tsTimestampTimestamp of data extraction.
27Job StatusactiveTextIndicates if the job posting is active.
28Expiry Dateexpiry-atDateDate of job expiry.
29Post Graduatepost_graduateTextIndicates if the job requires a post-graduate degree.
30EINeinTextEmployer Identification Number.