Leading researchers like Karl Friston —creating computational statistical models that minimize prediction-error. The human brain operates much the same way, also learning from data. A common argument goes: describe AI as "active inference" "AI will never be intelligent because it needs to see something thousands of times to learn it, while a human only needs to see something once." simply: " " As describes, "anyone on a mission to create a first-class AI-powered product needs vast amounts of data to feed to the machines," because multiple parameters and classes need to be learned. Others put it the more, the better. this Medium article However, we can outline a few ways to circumvent the need for big data in your AI journey: Transfer learning (including one-shot and zero-shot learning) Turn-key solutions (High quality) little data 1. Transfer Learning “Transfer learning is an up-and-coming technique that allows us to transfer the knowledge learned in one dataset and apply it to another dataset.” - Bradley Arsenault Transfer learning essentially takes learning from one domain and brings it to another, so you don't have to start from 0. This is especially useful in highly-specific domains with not much data available. To visualize it: This goes into the nuances of transfer learning. Besides not having to start from 0 with transfer learning, there are methods such as one-shot or even zero-shot learning that enable training models with minimal data. super in-depth guide One-shot learning is to infer required output based on just one or few training examples, as discussed in this paper: . ‘One Shot Learning of Object Categories’ Zero-shot learning is a more extreme version of the above, where no labelled examples are used to learn a task. 2. Turn-key solutions The second method of deploying AI with less data is by using turn-key solutions, that are already pre-trained on massive quantities of data. Here are just a few examples: Google Cloud AI "Google Cloud’s AI Hub provides enterprise-grade sharing capabilities, including end-to-end AI pipelines and out-of-the-box algorithms, that let your organization privately host AI content to foster reuse and collaboration among internal developers and users..." Microsoft Azure AI Platform "Only Azure empowers you with the most advanced machine learning capabilities. Quickly and easily build, train, and deploy your machine learning models using Azure Machine Learning, Azure Databricks and ONNX..." Amazon Machine Learning "AWS pre-trained AI Services provide ready-made intelligence for your applications and workflows. AI Services easily integrate with your applications to address common use cases such as personalized recommendations, modernizing your contact center, improving safety and security, and increasing customer engagement..." 3. Little data More data is not always , especially if that data is not labelled, not indicative of the problem at hand, or dirty. You might have millions of rows, but if it's messy data that's hardly relevant to the problem and only usable with unsupervised learning, then a smaller, highly-targeted, and clean data-set would be much better to have. better lists a few questions to decide between using little data and big data: This article Do you already have the data you need, and is it labeled? What’s your use case and what is the minimum data needed to address it? How advanced is your organization (really) when it comes to AI/ML? While big data is all the rage, it's not the only way to fuel ML models. would go even so far as to say that small data is the future of AI. Some A great discusses a few specific cases of using little data in the real-world. Harvard Business Review article For example, "researchers at Vicarious have developed a model that can at a far higher rate than deep neural networks and with 300-fold more data efficiency." Their model needed only training examples per character. break through CAPTCHAs five In conclusion, more data be better, and if you have it available, then great! However, if you don't, there are still ways to deploy AI in your organization. can

Amazon

Google

Microsoft

Super

Busting AI Myths: "You Need Tons of Data for Machine Learning"

About Author

Comments

TOPICS

THIS ARTICLE WAS FEATURED IN

Related Stories

14% of Americans Own Crypto. Only 12% Own Gold.

The Noonification: How Often Do NFTs Pass The Howey Test? (1/13/2023)

Darwin's Hybrid Intelligence to Align AI & Human Goals for Startups & VCs

The Noonification: White Man (11/26/2022)

The Noonification: The Metaverse is a Sh*tshow (11/2/2022)

100 Days of AI Day 1: From Newsletter to Podcast, Leveraging AI for Audio Transformation

14% of Americans Own Crypto. Only 12% Own Gold.

The Noonification: How Often Do NFTs Pass The Howey Test? (1/13/2023)

Darwin's Hybrid Intelligence to Align AI & Human Goals for Startups & VCs

The Noonification: White Man (11/26/2022)

The Noonification: The Metaverse is a Sh*tshow (11/2/2022)

100 Days of AI Day 1: From Newsletter to Podcast, Leveraging AI for Audio Transformation

Light-Mode

Classic

Newspaper

Minty

Dark-Mode

Neon Noir

Minty

HN StartUps