Data Sources

Overview of Data Sources

Our data sources include social media and web scraping, enabling us to identify approximately 25,000 new entities per month, of which 5,000 qualify as venture fundable businesses (startups).

In total, we add around 300,000 new entities per year.

Data Collection Methodologies

  • Continuous Searching: We continuously scan the web and social media for new startup companies that qualify as venture fundable businesses.
  • Data Enrichment: Proprietary technology enriches data by adding topic categories and signal indicators.
  • Detailed Descriptions: We formulate comprehensive descriptions of each startup, detailing the problem they solve and their target customers.
  • Founder Information: The company record includes founder details.
  • Contact Information: Fresh email addresses are sourced directly from company websites, facilitating direct communication with founders.

Fun Fact: At most companies until the Series A funding stage, CEOs themselves often read "info@" emails.

Data Quality and Accuracy

Ensuring Data Integrity
  • Duplicate Elimination: Various methods, including unique identifiers, help eliminate about 98% of duplicates. However, edge cases exist where companies use different names or are incorporated in multiple countries.
  • Founder Identification: Occasionally, duplicates go unrecognized, especially when founders use different name spellings or omit last names.
  • Accuracy Algorithm: An algorithm selects or merges the most accurate information when multiple sources provide the same data.
Regular Updates and Maintenance
  • Update Cadence: Company data is updated regularly, with higher update frequencies for companies in demand by most of our customer base.