Showing posts with label training. Show all posts
Showing posts with label training. Show all posts

15 March 2026

🤖〽️Prompt Engineering: Copilot Unabridged (Part 138: Understanding Gender Disparities in AI Models and How to Minimize Them)

Prompt Engineering Series
Prompt Engineering Series

Prompt: "write a post of 600 words on the causes of gender disparities in AI models and how can their occurrence be minimized"

Introduction

Artificial Intelligence (AI) has become deeply embedded in modern life, shaping decisions in hiring, healthcare, education, finance, and everyday digital interactions. Yet as AI systems grow more influential, concerns about gender disparities in their outputs have become increasingly urgent. These disparities are not the result of malicious intent within the technology itself? - AI has no intentions - but rather the reflection of human choices, historical inequalities, and structural biases embedded in data and design. Understanding the causes of gender disparities in AI models is essential for building systems that are fair, trustworthy, and inclusive. Equally important is identifying strategies to minimize these disparities so that AI contributes to a more equitable future.

Where Gender Disparities Come From

1. Biased or Unbalanced Training Data

AI models learn from examples. If the data used to train them reflects gender imbalances, stereotypes, or historical discrimination, the model will absorb and reproduce those patterns. For example:

  • Datasets dominated by male subjects can lead to poorer performance on female subjects.
  • Text corpora containing gender‑stereotypical language can cause models to associate certain professions or traits with one gender.
  • Historical hiring or lending data may encode discriminatory practices.

When the data is skewed, the model’s behavior becomes skewed as well.

2. Underrepresentation in Data Collection

Some groups are simply less represented in the data. This can happen unintentionally - for example, medical datasets that include fewer women, or voice recognition systems trained primarily on male voices. Underrepresentation leads to poorer accuracy and reliability for those groups, reinforcing inequality.

3. Lack of Diversity in Development Teams

AI systems reflect the perspectives of the people who build them. When development teams lack gender diversity, blind spots can emerge. Certain use cases may be overlooked, certain harms underestimated, and certain assumptions left unchallenged. Diversity is not just a social value - it is a technical necessity for robust design.

4. Ambiguous or Biased Labeling Practices

Human annotators label data, and their judgments can introduce bias. For example, labeling images, categorizing emotions, or classifying behaviors can be influenced by cultural or gendered assumptions. If labeling guidelines are unclear or inconsistent, bias becomes baked into the dataset.

5. Reinforcement of Societal Patterns

AI models often mirror the world as it is, not as it should be. If society exhibits gender disparities in pay, leadership roles, or representation, AI systems trained on real‑world data may reinforce those disparities. Without intervention, AI becomes a feedback loop that amplifies inequality.

How Gender Disparities Can Be Minimized

1. Improve Data Quality and Representation

  • Balanced, diverse, and carefully curated datasets are essential. This includes:
  • Ensuring representation across genders
  • Auditing datasets for skewed distributions
  • Removing or mitigating harmful stereotypes

Better data leads to better outcomes.

2. Use Bias Detection and Fairness Tools

Modern AI development includes tools that can:

  • Detect gender‑based performance gaps
  • Flag biased associations
  • Evaluate fairness across demographic groups

Regular auditing helps identify problems early.

3. Increase Diversity in AI Teams

A broader range of perspectives leads to more thoughtful design. Gender‑diverse teams are more likely to:

  • Spot potential harms
  • Challenge assumptions
  • Build inclusive systems

Diversity strengthens both ethics and performance.

4. Establish Clear Ethical Guidelines

Organizations should adopt frameworks that prioritize fairness, transparency, and accountability. Ethical guidelines help ensure that gender considerations are not an afterthought but a core design principle.

5. Encourage Human Oversight

AI should support - not replace - human judgment. Human review can catch issues that automated systems miss, especially in sensitive areas like hiring, healthcare, or legal decisions.

Closing Statement

Gender disparities in AI models are not inevitable; they are the result of human choices, historical patterns, and structural imbalances. By understanding their causes and taking deliberate steps to address them, we can build AI systems that are more fair, more accurate, and more aligned with the values of an inclusive society. The path forward requires better data, diverse teams, ethical design, and ongoing vigilance. With thoughtful effort, AI can become not a mirror of past inequalities, but a tool for a more equitable future.

Disclaimer: The whole text was generated by Copilot (under Windows 11) at the first attempt. This is just an experiment to evaluate feature's ability to answer standard general questions, independently on whether they are correctly or incorrectly posed. Moreover, the answers may reflect hallucinations and other types of inconsistent or incorrect reasoning.

Previous Post <<||>> Next Post

03 March 2026

🤖〽️Prompt Engineering: Copilot Unabridged (Part 127: Understanding Facts in Modern AI)

Prompt Engineering Series
Prompt Engineering Series


Prompt: "write a post of 600 words on what is meant by facts in nowadays AI and include an introduction, respectively a closing statement"

Introduction

As Artificial Intelligence (AI) becomes more deeply integrated into communication, research, and decision‑making, the question of what AI considers a fact has become increasingly important. People often assume that if an AI states something confidently, it must be true. But AI systems do not possess a stable internal database of verified truths. Instead, they generate responses by predicting what information is most likely to follow from patterns in the data they were trained on. Understanding how facts function in modern AI helps clarify why these systems can be powerful tools - and why they sometimes produce errors or fabrications.

What a 'Fact' Means for Humans

For humans, a fact is a statement that can be verified through observation, evidence, or reliable sources. Facts are:

  • Stable: they do not change depending on context.
  • Grounded: they refer to real‑world states or events.
  • Verifiable: they can be checked against evidence.
  • Independent: they exist whether or not someone remembers them.

Human understanding of facts is tied to reasoning, experience, and shared standards of truth.

How AI Models Handle Facts

AI systems do not have beliefs, memories, or understanding. They work by identifying statistical patterns in massive datasets. This leads to a different relationship with facts:

  • Facts are patterns: not stored entries but tendencies in the data.
  • Facts are probabilistic: the model generates what seems likely, not what is verified.
  • Facts are context‑sensitive: the same question phrased differently may yield different answers.
  • Facts are not inherently distinguished from non‑facts: the model does not “know” what is true; it only predicts what fits the pattern.

This is why AI can produce accurate information in one moment and incorrect information in another.

The Fragility of AI Facts

Because AI relies on statistical inference, several factors can distort factual accuracy:

  • Training data limitations: if the data is outdated, incomplete, or biased, the model’s 'facts' reflect those flaws.
  • Ambiguous prompts: unclear questions can lead to confident but incorrect answers.
  • Lack of real‑time grounding: unless connected to external sources, AI cannot update facts after training.
  • Hallucinations: the model may generate plausible‑sounding but false statements when patterns are weak or conflicting.

These issues highlight that AI does not know facts; it reconstructs them.

Why AI Can Still Be Factually Useful

Despite these limitations, AI can be highly effective at working with factual information when used appropriately. Its strengths include:

  • Synthesizing large volumes of data: AI can integrate information from many sources at once.
  • Recognizing factual patterns: it can identify common knowledge across diverse texts.
  • Retrieving structured information: when connected to verified databases or tools, it can provide up‑to‑date facts. 
  • Supporting human fact‑checking: AI can surface relevant details quickly, which humans can then verify.

In this sense, AI acts as a fact assistant, not a fact authority.

The Human Role in Defining Facts for AI

Because AI cannot distinguish truth from falsehood on its own, humans play a crucial role in shaping factual accuracy:

  • Curating training data: selecting high‑quality, diverse, and reliable sources.
  • Building guardrails : designing systems that avoid unsupported claims.
  • Providing feedback: correcting errors to improve future performance.
  • Maintaining oversight: verifying outputs before relying on them for decisions.

AI becomes more reliable when humans treat it as a collaborator rather than an oracle.

Closing Statement

Facts in modern AI are not fixed truths stored inside a machine but statistical echoes of the data used to train it. Understanding this distinction helps set realistic expectations: AI can be a powerful tool for accessing and organizing information, but it cannot replace human judgment, verification, or critical thinking. As AI continues to evolve, the challenge is to build systems that handle facts responsibly - and to ensure that humans remain the final arbiters of truth.

Disclaimer: The whole text was generated by Copilot (under Windows 11) at the first attempt. This is just an experiment to evaluate feature's ability to answer standard general questions, independently on whether they are correctly or incorrectly posed. Moreover, the answers may reflect hallucinations and other types of inconsistent or incorrect reasoning.

Previous Post <<||>> Next Post

02 March 2026

🤖〽️Prompt Engineering: Copilot Unabridged (Part 126: Understanding Knowledge in Modern AI)

Prompt Engineering Series
Prompt Engineering Series


Prompt: "write a post of 600 words on what is meant by intelligence in nowadays AI and include an introduction, respectively a closing statement"

Introduction

In recent years, Artificial Intelligence (AI) has moved from a niche research field into a central force shaping society, industry, and everyday life. As AI systems become more capable, a key question emerges: What does 'knowledge' actually mean for these systems? When we say an AI 'knows' something, we are using a metaphor that can be misleading if taken too literally. Understanding the nature of AI knowledge helps clarify both the power and the limits of these technologies - and helps us use them responsibly.

How AI Represents Knowledge

Modern AI systems, especially large language models, do not store knowledge as humans do. Instead of memories, concepts, or experiences, they rely on patterns in data. These patterns are encoded in mathematical structures - billions of parameters that capture statistical relationships between words, images, or other inputs.

Three characteristics define this form of knowledge:

  • Statistical rather than experiential: AI does not learn through lived experience but through exposure to vast datasets. It identifies correlations, not meanings.
  • Implicit rather than explicit:Knowledge is not stored as facts in a database but as distributed weights across a neural network.
  • Generalized rather than specific: AI does not recall exact documents unless explicitly designed to do so; it generates responses by predicting what is likely based on learned patterns.

This means AI 'knowledge' is powerful for pattern recognition and language generation but does not involve understanding, consciousness, or subjective awareness.

The Role of Training Data

AI knowledge is shaped by the data it is trained on. This has several implications:

  • Breadth: AI can integrate information from millions of sources, far beyond human capacity.
  • Bias: If the data contains biases, stereotypes, or inaccuracies, the model may reproduce them.
  • Temporal limits: AI knowledge reflects the state of the world at the time of training; without updates, it becomes outdated.

Because of this, AI knowledge is always a snapshot - comprehensive but not timeless.

Knowledge as Capability

In practice, AI knowledge is best understood as capability: the ability to perform tasks that require information, such as summarizing texts, generating ideas, answering questions, or analyzing patterns. This capability emerges from:

  • Language modeling: predicting the next word in a sequence.
  • Representation learning: capturing abstract relationships between concepts.
  • Reasoning heuristics: approximating logical or analytical processes.

These capabilities can resemble human reasoning, but they are fundamentally different in origin and mechanism.

The Human–AI Knowledge Gap

Humans and AI differ not only in how they store knowledge but also in how they use it:

  • Humans rely on context, emotion, intuition, and lived experience.
  • AI relies on probabilities, patterns, and statistical inference.

This gap is why AI can excel at tasks like translation or summarization yet struggle with ambiguity, ethics, or common sense. It also explains why AI can generate confident but incorrect statements - a phenomenon often called 'hallucination'.

Why Understanding AI Knowledge Matters

Recognizing what AI does and does not 'know' helps society make better decisions about its use. It encourages:

  • Critical thinking: treating AI outputs as suggestions, not truths.
  • Responsible deployment: using AI where pattern recognition is useful, not where moral judgment is required.
  • Better design: building systems that combine AI’s strengths with human oversight.

Closing Statement

Knowledge in today’s AI is not a mirror of human understanding but a new, powerful form of statistical representation. Appreciating this difference allows us to harness AI’s capabilities while remaining aware of its limitations. As AI continues to evolve, our challenge is not to make machines think like humans, but to integrate their unique form of knowledge into society in ways that are safe, transparent, and beneficial.

Disclaimer: The whole text was generated by Copilot (under Windows 11) at the first attempt. This is just an experiment to evaluate feature's ability to answer standard general questions, independently on whether they are correctly or incorrectly posed. Moreover, the answers may reflect hallucinations and other types of inconsistent or incorrect reasoning.

Previous Post <<||>> Next Post

05 January 2026

🤖〽️Prompt Engineering: Copilot Unabridged (Part 124: The Impact of Algorithmic Bias on AI: Why It Matters More Than Ever)

Prompt Engineering Series
Prompt Engineering Series

Prompt: "write a blog post of 600 words on the impact of algorithmic bias on AI"

Introduction

Artificial intelligence (AI) has become woven into the fabric of modern life. It recommends what we watch, filters what we read, evaluates our job applications, approves our loans, and increasingly shapes decisions once made exclusively by humans. Yet beneath the surface of this technological progress lies a persistent and often invisible challenge: algorithmic bias. As AI systems grow more influential, understanding the origins, consequences, and solutions to bias becomes essential - not just for technologists, but for anyone living in a world mediated by algorithms.

What Is Algorithmic Bias?

Algorithmic bias occurs when an AI system produces systematically unfair outcomes - favoring or disadvantaging certain groups - because of patterns embedded in its data, design, or deployment. Contrary to popular belief, bias in AI is rarely the result of malicious intent. Instead, it emerges from the simple fact that AI learns from historical data, and history is full of human imperfections.

If the data reflects societal inequalities, the model will learn those inequalities. If the training set underrepresents certain populations, the model will perform worse for them. And if the objectives or constraints are poorly defined, the system may optimize for the wrong outcomes entirely.

In other words, AI doesn’t just mirror the world - it can magnify its flaws.

Where Bias Creeps In

Bias can enter an AI system at multiple stages:

1. Biased Training Data

AI models learn statistical patterns from examples. If those examples are skewed, incomplete, or unrepresentative, the model inherits those distortions. Classic cases include facial recognition systems that perform poorly on darker skin tones because the training data was overwhelmingly composed of lighter-skinned faces.

2. Problem Framing and Design Choices

Even before data enters the picture, human decisions shape the system. What is the model optimizing for? What counts as a 'successful' prediction? Which variables are included or excluded? These choices embed assumptions that can unintentionally privilege certain outcomes.

3. Feedback Loops in Deployment

Once deployed, AI systems can reinforce their own biases. For example, predictive policing tools may direct more patrols to neighborhoods flagged as 'high risk', generating more recorded incidents and further validating the model’s initial assumptions - even if the underlying crime rates were similar elsewhere.

Why Algorithmic Bias Matters

The consequences of biased AI are not abstract - they affect real people in tangible ways.

1. Inequitable Access and Opportunity

Biased hiring algorithms may filter out qualified candidates. Loan approval models may disproportionately reject applicants from certain demographics. Healthcare algorithms may underestimate risk for specific populations, leading to worse outcomes.

2. Erosion of Trust

When people experience or perceive unfairness in algorithmic decisions, trust in institutions erodes. This is especially damaging in sectors like government, healthcare, and finance, where legitimacy depends on perceived fairness.

3. Amplification of Social Inequalities

AI systems operate at scale. A biased decision made once by a human affects one person; a biased decision made by an algorithm affects thousands or millions. Without intervention, AI can accelerate and entrench existing disparities.

Toward Fairer, More Responsible AI

Addressing algorithmic bias is not a one-time fix - it’s an ongoing process requiring technical, organizational, and societal commitment.

1. Better Data Practices

Diverse, representative datasets reduce the risk of skewed outcomes. Equally important is transparency about data sources and limitations.

2. Rigorous Testing and Auditing

Bias detection tools, fairness metrics, and independent audits help identify issues before they cause harm. Continuous monitoring ensures that systems remain fair as conditions change.

3. Human Oversight and Accountability

AI should augment - not replace - human judgment. Clear accountability structures ensure that organizations remain responsible for the outcomes of their systems.

4. Ethical and Regulatory Frameworks

Governments and industry bodies are increasingly developing guidelines to ensure fairness, transparency, and accountability. These frameworks help align AI development with societal values.

Conclusion

Algorithmic bias is not an inevitable flaw of AI - it’s a challenge we can address with thoughtful design, responsible data practices, and ongoing oversight. As AI continues to shape our world, the choices we make today will determine whether these systems reinforce old inequities or help build a more just and inclusive future.

Disclaimer: The whole text was generated by Copilot (under Windows 10) at the first attempt. This is just an experiment to evaluate feature's ability to answer standard general questions, independently on whether they are correctly or incorrectly posed. Moreover, the answers may reflect hallucinations and other types of inconsistent or incorrect reasoning.

Previous Post <<||>> Next Post

01 January 2026

🤖〽️Prompt Engineering: Copilot Unabridged (Part 123: Why AI Is Not Bulletproof in Automation Scenarios)

Prompt Engineering Series
Prompt Engineering Series

Prompt: "write a blogpost of 600 words on why AI is not bulletproof is automation scenarios"

Introduction

Artificial Intelligence (AI) has become the poster child of modern automation. From supply‑chain optimization to customer service chatbots to predictive maintenance in factories, it’s easy to assume that AI systems are precise, tireless, and - at least in theory - nearly infallible. But that assumption is exactly where organizations get into trouble. AI is powerful, yes, but bulletproof? Not even close. And understanding why it isn’t bulletproof is essential for anyone deploying automation in the next decade.

Let’s unpack the cracks beneath the shiny surface.

AI Learns From Data - And Data Is Messy

AI systems don’t understand the world; they understand patterns in data. And real‑world data is full of noise, bias, gaps, and contradictions.

  • A model trained on historical hiring data may inherit past discrimination.
  • A predictive maintenance system may fail if sensors degrade or environmental conditions shift.
  • A customer‑service bot may misinterpret a request simply because the phrasing wasn’t in its training set. 

When the data is imperfect, the automation built on top of it inherits those imperfections. AI doesn’t magically 'fix' flawed data - it amplifies it.

Automation Assumes Stability, but the Real World Is Dynamic

Traditional automation works best in stable, predictable environments. AI‑driven automation is more flexible, but it still struggles when the world changes faster than the model can adapt.

Consider:

  • Sudden market shifts
  • New regulations
  • Unexpected supply‑chain disruptions
  • Novel user behaviors
  • Rare edge‑case events

AI models trained on yesterday’s patterns can’t automatically understand tomorrow’s anomalies. Without continuous monitoring and retraining, automation becomes brittle.

AI Doesn’t 'Understand' - It Correlates

Even the most advanced AI systems don’t possess human‑level reasoning or contextual awareness. They operate on statistical correlations, not comprehension.

This leads to automation failures like:

  • Misclassifying harmless anomalies as threats
  • Failing to detect subtle but critical changes
  • Producing confident but incorrect outputs
  • Following rules literally when nuance is required

In high‑stakes environments - healthcare, finance, transportation - this lack of true understanding becomes a serious limitation.

Edge Cases Are the Achilles’ Heel

AI performs impressively on common scenarios but struggles with rare events. Unfortunately, automation systems often encounter exactly those rare events.

Examples include:

  • A self‑driving car encountering an unusual road layout
  • A fraud‑detection model missing a novel attack pattern
  • A warehouse robot misinterpreting an unexpected obstacle

Humans excel at improvisation; AI does not. Automation breaks down when reality refuses to fit the training distribution.

Security Vulnerabilities Undermine Reliability

AI systems introduce new attack surfaces:

  • Adversarial inputs can trick models with tiny, invisible perturbations.
  • Data poisoning can corrupt training sets.
  • Model inversion can leak sensitive information.
  • Prompt manipulation can cause unintended behavior in language models.
  • Automation built on AI can be manipulated in ways traditional systems never could.

Ethical and Governance Gaps Create Operational Risk

Even when AI works “correctly,” it may still cause harm if governance is weak.

Automation can:

  • Reinforce bias
  • Reduce transparency
  • Remove human oversight
  • Make decisions that lack accountability

Organizations often underestimate the social and regulatory risks of automated decision‑making. AI isn’t just a technical system - it’s a socio‑technical one.

Humans Are Still Part of the System

The biggest misconception about automation is that it removes humans. In reality, it changes the role of humans.

People must:

  • Monitor AI outputs
  • Intervene during failures
  • Interpret ambiguous results
  • Maintain and retrain models
  • Handle exceptions and edge cases

If humans aren’t properly trained or workflows aren’t redesigned, automation becomes fragile.

The Bottom Line: AI Is Powerful, but Not Invincible

AI can supercharge automation, but it’s not a magic wand. It’s a tool - one that requires careful design, continuous oversight, and a deep understanding of its limitations. Organizations that treat AI as bulletproof will face costly failures. Those that treat it as a dynamic, fallible component of a broader ecosystem will unlock its real value.

Disclaimer: The whole text was generated by Copilot (under Windows 11) at the first attempt. This is just an experiment to evaluate feature's ability to answer standard general questions, independently on whether they are correctly or incorrectly posed. Moreover, the answers may reflect hallucinations and other types of inconsistent or incorrect reasoning.

Previous Post <<||>> Next Post

30 October 2025

🤖〽️Prompt Engineering: Copilot Unabridged (Part 111: The Availability Heuristic in AI - When Familiarity Skews Intelligence)

 

Prompt Engineering Series
Prompt Engineering Series

Prompt: "write a post of 600 words on what is needed for creating a foundation for the further development of AI"

Introduction

In the age of Artificial Intelligence (AI), we often assume that machines are immune to human biases. But the truth is more nuanced. AI systems, while built on data and algorithms, are deeply influenced by human psychology - especially during their design, training, and interpretation. One cognitive bias that plays a significant role is the availability heuristic.

Coined by psychologists Amos Tversky and Daniel Kahneman, the availability heuristic is a mental shortcut where people estimate the probability of an event based on how easily examples come to mind. For instance, if you recently heard about a plane crash, you might overestimate the risk of flying - even though statistically, it's safer than driving. This bias helps us make quick decisions, but it often leads to errors in judgment.

How It Shows Up in AI Systems

AI models are trained on data - lots of it. But the availability of certain data types can skew the model’s understanding of reality. If a dataset contains more examples of one type of event (say, fraudulent transactions from a specific region), the AI may overestimate the likelihood of fraud in that region, even if the real-world distribution is different. This is a direct reflection of the availability heuristic: the model 'sees' more of something and assumes it’s more common.

Moreover, developers and data scientists are not immune to this bias. When selecting training data or designing algorithms, they may rely on datasets that are readily available or familiar, rather than those that are representative. This can lead to biased outcomes, especially in sensitive domains like healthcare, hiring, or criminal justice. 

Human Interpretation of AI Outputs

The availability heuristic doesn’t just affect AI systems - it also affects how humans interpret them. When users interact with AI tools like ChatGPT or recommendation engines, they often accept the first answer or suggestion without questioning its accuracy. Why? Because it’s available, and our brains are wired to trust what’s easy to access.

This is particularly dangerous in high-stakes environments. For example, a doctor using an AI diagnostic tool might favor a diagnosis that the system presents prominently, even if it’s not the most accurate. If the AI has been trained on a dataset where a certain condition appears frequently, it might over-represent that condition in its suggestions. The human, influenced by availability bias, might accept it without deeper scrutiny.

The Role of Information Overload

In today’s digital world, we’re bombarded with information. AI systems help us filter and prioritize, but they also reinforce the availability heuristic. Search engines, social media algorithms, and news aggregators show us what’s popular or trending - not necessarily what’s accurate. As a result, we form opinions and make decisions based on what we see most often, not what’s most valid.

This creates echo chambers and reinforces stereotypes. For instance, if an AI-powered news feed frequently shows stories about crime in urban areas, users may develop a skewed perception of urban safety - even if crime rates are declining.

Mitigating the Bias

To combat the availability heuristic in AI, both developers and users must be proactive:

  • Diversify training data to ensure models reflect reality, not just what’s easy to collect.
  • Design transparent systems that explain how decisions are made.
  • Educate users about cognitive biases and encourage critical thinking.
  • Audit AI outputs regularly to identify patterns of overrepresentation or omission.
Conclusion

The availability heuristic is a powerful psychological bias that influences both the design and interpretation of AI systems. As we rely more on AI to guide decisions, understanding and mitigating this bias becomes essential - not just for accuracy, but for fairness and trust.

Just try the prompt on Copilot or your favorite AI-powered assistant! Have you got a different/similar result? How big or important is the difference? Any other thoughts?
Just share the link to the post with me and I'll add it to this post as a resource!

Disclaimer: The whole text was generated by Copilot (under Windows 11) at the first attempt. This is just an experiment to evaluate feature's ability to answer standard general questions, independently on whether they are correctly or incorrectly posed. Moreover, the answers may reflect hallucinations and other types of inconsistent or incorrect reasoning.

Previous Post <<||>> Next Post

16 April 2025

🧮ERP: Implementations (Part XIII: On Project Management)

ERP Implementations Series
ERP Implementations Series

Given its intrinsic complexity and extended implications, an ERP implementation can be considered as the real test of endurance for a Project Manager, respectively the team managed. Such projects typically deal with multiple internal and external parties with various interests in the outcomes of the project. Moreover, such projects involve multiple technologies, systems, and even methodologies. But, more importantly, such projects tend to have specific characteristics associated with their mass, being challenging to manage within the predefined constraints: time, scope, costs and quality.

From a Project Manager’s perspective what counts is only the current project. From a PMO perspective, one project, independent of its type, must be put within the broader perspective, while looking at the synergies and other important aspects that can help the organization. Unfortunately, for many organizations all begins and ends with the implementation, and this independently of the outcomes of the project. Often failure lurks in the background and usually there can be small differences that in the long term have a considerable impact. ERP implementations are more than other projects sensitive on the initial conditions – the premises under which the project starts and progresses. 

One way of coping with this inherent complexity is to split projects into several phases considered as projects or subprojects in their own boundaries. This allows organizations to narrow the focus and split the overall work into more manageable pieces, reducing to some degree the risks while learning in the process about organization’s capabilities in addressing the various aspects. Conversely, the phases are not necessarily sequential but often must overlap to better manage the resources and minimize waste. 

Given that an implementation project can take years, it’s normal for people to come and go, some taking over work from colleagues, with or without knowledge transfer. The knowledge is available further on, as long as the resources don’t leave the organization, though knowledge transfer can’t be taken for granted. It’s also normal for resources to suddenly not be available or disappear, increasing the burden that needs to be shifted on others’ shoulders. There’s seldom a project without such events and one needs to make the best of each situation, even if several tries and iterations are needed in the process.

Somebody needs to manage all this, and the weight of the whole project falls on a PM’s shoulders. Managing by exception and other management principles break under the weight of implementation projects and often it’s challenging to make progress without addressing this. Fortunately, PMs can shift the burden on Key Users and other parties involved in the project. Splitting a project in subprojects can help set boundaries even if more management could occasionally be involved. Also having clear responsibilities and resources who can take over the burdens when needed can be a sign of maturity of the teams, respectively the organization. 

Teams in Project Management are often compared with teams in sports, though the metaphor is partially right when each party has a ball to play with, while some of the players or even teams prefer to play alone at their own pace. It takes time to build effective teams that play well together, and the team spirit or other similar concepts can't fill all the gaps existing in organizations! Training in team sports has certain characteristics that must be mirrored in organizations to allow for teams to improve. Various parties expect from the PM to be the binder and troubleshooter of something that should have been part of an organization’s DNA! Bringing external players to do the heavy lifting may sometimes work, though who’ll do the lifting after the respective resources are gone? 

Previous Post <<||>> Next Post

29 March 2021

Notes: Team Data Science Process (TDSP)

Team Data Science Process (TDSP)
Acronyms:
Artificial Intelligence (AI)
Cross-Industry Standard Process for Data Mining (CRISP-DM)
Data Mining (DM)
Knowledge Discovery in Databases (KDD)
Team Data Science Process (TDSP) 
Version Control System (VCS)
Visual Studio Team Services (VSTS)

Resources:
[1] Microsoft Azure (2020) What is the Team Data Science Process? [source]
[2] Microsoft Azure (2020) The business understanding stage of the Team Data Science Process lifecycle [source]
[3] Microsoft Azure (2020) Data acquisition and understanding stage of the Team Data Science Process [source]
[4] Microsoft Azure (2020) Modeling stage of the Team Data Science Process lifecycle [source
[5] Microsoft Azure (2020) Deployment stage of the Team Data Science Process lifecycle [source]
[6] Microsoft Azure (2020) Customer acceptance stage of the Team Data Science Process lifecycle [source]

12 December 2018

🔭Data Science: Neural Networks (Just the Quotes)

"The terms 'black box' and 'white box' are convenient and figurative expressions of not very well determined usage. I shall understand by a black box a piece of apparatus, such as four-terminal networks with two input and two output terminals, which performs a definite operation on the present and past of the input potential, but for which we do not necessarily have any information of the structure by which this operation is performed. On the other hand, a white box will be similar network in which we have built in the relation between input and output potentials in accordance with a definite structural plan for securing a previously determined input-output relation." (Norbert Wiener, "Cybernetics: Or Control and Communication in the Animal and the Machine", 1948)

"A neural network is a massively parallel distributed processor that has a natural propensity for storing experiential knowledge and making it available for use. It resembles the brain in two respects: 1. Knowledge is acquired by the network through a learning process. 2. Interneuron connection strengths known as synaptic weights are used to store the knowledge." (Igor Aleksander, "An introduction to neural computing", 1990) 

"Neural Computing is the study of networks of adaptable nodes which through a process of learning from task examples, store experiential knowledge and make it available for use." (Igor Aleksander, "An introduction to neural computing", 1990)

"A neural network is characterized by (1) its pattern of connections between the neurons (called its architecture), (2) its method of determining the weights on the connections (called its training, or learning, algorithm), and (3) its activation function." (Laurene Fausett, "Fundamentals of Neural Networks", 1994)

"An artificial neural network is an information-processing system that has certain performance characteristics in common with biological neural networks. Artificial neural networks have been developed as generalizations of mathematical models of human cognition or neural biology, based on the assumptions that: (1) Information processing occurs at many simple elements called neurons. (2) Signals are passed between neurons over connection links. (3) Each connection link has an associated weight, which, in a typical neural net, multiplies the signal transmitted. (4) Each neuron applies an activation function (usually nonlinear) to its net input (sum of weighted input signals) to determine its output signal." (Laurene Fausett, "Fundamentals of Neural Networks", 1994)

"An artificial neural network (or simply a neural network) is a biologically inspired computational model that consists of processing elements (neurons) and connections between them, as well as of training and recall algorithms." (Nikola K Kasabov, "Foundations of Neural Networks, Fuzzy Systems, and Knowledge Engineering", 1996)

"Many of the basic functions performed by neural networks are mirrored by human abilities. These include making distinctions between items (classification), dividing similar things into groups (clustering), associating two or more things (associative memory), learning to predict outcomes based on examples (modeling), being able to predict into the future (time-series forecasting), and finally juggling multiple goals and coming up with a good- enough solution (constraint satisfaction)." (Joseph P Bigus,"Data Mining with Neural Networks: Solving business problems from application development to decision support", 1996)

"More than just a new computing architecture, neural networks offer a completely different paradigm for solving problems with computers. […] The process of learning in neural networks is to use feedback to adjust internal connections, which in turn affect the output or answer produced. The neural processing element combines all of the inputs to it and produces an output, which is essentially a measure of the match between the input pattern and its connection weights. When hundreds of these neural processors are combined, we have the ability to solve difficult problems such as credit scoring." (Joseph P Bigus,"Data Mining with Neural Networks: Solving business problems from application development to decision support", 1996)

"Neural networks are a computing model grounded on the ability to recognize patterns in data. As a consequence, they have many applications to data mining and analysis." (Joseph P Bigus,"Data Mining with Neural Networks: Solving business problems from application development to decision support", 1996)

"Neural networks are a computing technology whose fundamental purpose is to recognize patterns in data. Based on a computing model similar to the underlying structure of the human brain, neural networks share the brains ability to learn or adapt in response to external inputs. When exposed to a stream of training data, neural networks can discover previously unknown relationships and learn complex nonlinear mappings in the data. Neural networks provide some fundamental, new capabilities for processing business data. However, tapping these new neural network data mining functions requires a completely different application development process from traditional programming." (Joseph P Bigus, "Data Mining with Neural Networks: Solving business problems from application development to decision support", 1996)

"The most familiar example of swarm intelligence is the human brain. Memory, perception and thought all arise out of the nett actions of billions of individual neurons. As we saw earlier, artificial neural networks (ANNs) try to mimic this idea. Signals from the outside world enter via an input layer of neurons. These pass the signal through a series of hidden layers, until the result emerges from an output layer. Each neuron modifies the signal in some simple way. It might, for instance, convert the inputs by plugging them into a polynomial, or some other simple function. Also, the network can learn by modifying the strength of the connections between neurons in different layers." (David G Green, "The Serendipity Machine: A voyage of discovery through the unexpected world of computers", 2004)

"A neural network is a particular kind of computer program, originally developed to try to mimic the way the human brain works. It is essentially a computer simulation of a complex circuit through which electric current flows." (Keith J Devlin & Gary Lorden, "The Numbers behind NUMB3RS: Solving crime with mathematics", 2007)

 "Neural networks are a popular model for learning, in part because of their basic similarity to neural assemblies in the human brain. They capture many useful effects, such as learning from complex data, robustness to noise or damage, and variations in the data set. " (Peter C R Lane, Order Out of Chaos: Order in Neural Networks, 2007)

"A network of many simple processors ('units' or 'neurons') that imitates a biological neural network. The units are connected by unidirectional communication channels, which carry numeric data. Neural networks can be trained to find nonlinear relationships in data, and are used in various applications such as robotics, speech recognition, signal processing, medical diagnosis, or power systems." (Adnan Khashman et al, "Voltage Instability Detection Using Neural Networks", 2009)

"An artificial neural network, often just called a 'neural network' (NN), is an interconnected group of artificial neurons that uses a mathematical model or computational model for information processing based on a connectionist approach to computation. Knowledge is acquired by the network from its environment through a learning process, and interneuron connection strengths (synaptic weighs) are used to store the acquired knowledge." (Larbi Esmahi et al, "Adaptive Neuro-Fuzzy Systems", 2009)

"Generally, these programs fall within the techniques of reinforcement learning and the majority use an algorithm of temporal difference learning. In essence, this computer learning paradigm approximates the future state of the system as a function of the present state. To reach that future state, it uses a neural network that changes the weight of its parameters as it learns." (Diego Rasskin-Gutman, "Chess Metaphors: Artificial Intelligence and the Human Mind", 2009)

"The simplest basic architecture of an artificial neural network is composed of three layers of neurons - input, output, and intermediary (historically called perceptron). When the input layer is stimulated, each node responds in a particular way by sending information to the intermediary level nodes, which in turn distribute it to the output layer nodes and thereby generate a response. The key to artificial neural networks is in the ways that the nodes are connected and how each node reacts to the stimuli coming from the nodes it is connected to. Just as with the architecture of the brain, the nodes allow information to pass only if a specific stimulus threshold is passed. This threshold is governed by a mathematical equation that can take different forms. The response depends on the sum of the stimuli coming from the input node connections and is 'all or nothing'." (Diego Rasskin-Gutman, "Chess Metaphors: Artificial Intelligence and the Human Mind", 2009)

"Neural networks can model very complex patterns and decision boundaries in the data and, as such, are very powerful. In fact, they are so powerful that they can even model the noise in the training data, which is something that definitely should be avoided. One way to avoid this overfitting is by using a validation set in a similar way as with decision trees.[...] Another scheme to prevent a neural network from overfitting is weight regularization, whereby the idea is to keep the weights small in absolute sense because otherwise they may be fitting the noise in the data. This is then implemented by adding a weight size term (e.g., Euclidean norm) to the objective function of the neural network." (Bart Baesens, "Analytics in a Big Data World: The Essential Guide to Data Science and Its Applications", 2014)

"A neural network consists of a set of neurons that are connected together. A neuron takes a set of numeric values as input and maps them to a single output value. At its core, a neuron is simply a multi-input linear-regression function. The only significant difference between the two is that in a neuron the output of the multi-input linear-regression function is passed through another function that is called the activation function." (John D Kelleher & Brendan Tierney, "Data Science", 2018)

"Just as they did thirty years ago, machine learning programs (including those with deep neural networks) operate almost entirely in an associational mode. They are driven by a stream of observations to which they attempt to fit a function, in much the same way that a statistician tries to fit a line to a collection of points. Deep neural networks have added many more layers to the complexity of the fitted function, but raw data still drives the fitting process. They continue to improve in accuracy as more data are fitted, but they do not benefit from the 'super-evolutionary speedup'."  (Judea Pearl & Dana Mackenzie, "The Book of Why: The new science of cause and effect", 2018)

"A neural-network algorithm is simply a statistical procedure for classifying inputs (such as numbers, words, pixels, or sound waves) so that these data can mapped into outputs. The process of training a neural-network model is advertised as machine learning, suggesting that neural networks function like the human mind, but neural networks estimate coefficients like other data-mining algorithms, by finding the values for which the model’s predictions are closest to the observed values, with no consideration of what is being modeled or whether the coefficients are sensible." (Gary Smith & Jay Cordes, "The 9 Pitfalls of Data Science", 2019)

"Deep neural networks have an input layer and an output layer. In between, are “hidden layers” that process the input data by adjusting various weights in order to make the output correspond closely to what is being predicted. [...] The mysterious part is not the fancy words, but that no one truly understands how the pattern recognition inside those hidden layers works. That’s why they’re called 'hidden'. They are an inscrutable black box - which is okay if you believe that computers are smarter than humans, but troubling otherwise." (Gary Smith & Jay Cordes, "The 9 Pitfalls of Data Science", 2019)

"Neural-network algorithms do not know what they are manipulating, do not understand their results, and have no way of knowing whether the patterns they uncover are meaningful or coincidental. Nor do the programmers who write the code know exactly how they work and whether the results should be trusted. Deep neural networks are also fragile, meaning that they are sensitive to small changes and can be fooled easily." (Gary Smith & Jay Cordes, "The 9 Pitfalls of Data Science", 2019)

"The label neural networks suggests that these algorithms replicate the neural networks in human brains that connect electrically excitable cells called neurons. They don’t. We have barely scratched the surface in trying to figure out how neurons receive, store, and process information, so we cannot conceivably mimic them with computers." (Gary Smith & Jay Cordes, "The 9 Pitfalls of Data Science", 2019)

More quotes on "Neural Networks" at the-web-of-knowledge.blogspot.com.

13 November 2018

🔭Data Science: Training (Just the Quotes)

"A neural network is characterized by (1) its pattern of connections between the neurons (called its architecture), (2) its method of determining the weights on the connections (called its training, or learning, algorithm), and (3) its activation function." (Laurene Fausett, "Fundamentals of Neural Networks", 1994)

"An artificial neural network (or simply a neural network) is a biologically inspired computational model that consists of processing elements (neurons) and connections between them, as well as of training and recall algorithms." (Nikola K Kasabov, "Foundations of Neural Networks, Fuzzy Systems, and Knowledge Engineering", 1996)

"[…] an obvious difference between our best classifiers and human learning is the number of examples required in tasks such as object detection. […] the difficulty of a learning task depends on the size of the required hypothesis space. This complexity determines in turn how many training examples are needed to achieve a given level of generalization error. Thus the complexity of the hypothesis space sets the speed limit and the sample complexity for learning." (Tomaso Poggio & Steve Smale, "The Mathematics of Learning: Dealing with Data", Notices of the AMS, 2003)

"Learning a complicated function that matches the training data closely but fails to recognize the underlying process that generates the data. As a result of overfitting, the model performs poor on new input. Overfitting occurs when the training patterns are sparse in input space and/or the trained networks are too complex." (Frank Padberg, "Counting the Hidden Defects in Software Documents", 2010)

"Decision trees are also considered nonparametric models. The reason for this is that when we train a decision tree from data, we do not assume a fixed set of parameters prior to training that define the tree. Instead, the tree branching and the depth of the tree are related to the complexity of the dataset it is trained on. If new instances were added to the dataset and we rebuilt the tree, it is likely that we would end up with a (potentially very) different tree." (John D Kelleher et al, "Fundamentals of Machine Learning for Predictive Data Analytics: Algorithms, Worked Examples, and Case Studies", 2015)

"Boosting defines an objective function to measure the performance of a model given a certain set of parameters. The objective function contains two parts: regularization and training loss, both of which add to one another. The training loss measures how predictive our model is on the training data. The most commonly used training loss function includes mean squared error and logistic regression. The regularization term controls the complexity of the model, which helps avoid overfitting." (Danish Haroon, "Python Machine Learning Case Studies", 2017)

"Decision trees are important for a few reasons. First, they can both classify and regress. It requires literally one line of code to switch between the two models just described, from a classification to a regression. Second, they are able to determine and share the feature importance of a given training set." (Russell Jurney, "Agile Data Science 2.0: Building Full-Stack Data Analytics Applications with Spark", 2017)

"Early stopping and regularization can ensure network generalization when you apply them properly. [...] With early stopping, the choice of the validation set is also important. The validation set should be representative of all points in the training set. When you use Bayesian regularization, it is important to train the network until it reaches convergence. The sum-squared error, the sum-squared weights, and the effective number of parameters should reach constant values when the network has converged. With both early stopping and regularization, it is a good idea to train the network starting from several different initial conditions. It is possible for either method to fail in certain circumstances. By testing several different initial conditions, you can verify robust network performance." (Mark H Beale et al, "Neural Network Toolbox™ User's Guide", 2017)

"Variance is a prediction error due to different sets of training samples. Ideally, the error should not vary from one training sample to another sample, and the model should be stable enough to handle hidden variations between input and output variables. Normally this occurs with the overfitted model." (Umesh R Hodeghatta & Umesha Nayak, "Business Analytics Using R: A Practical Approach", 2017)

"One of the most common problems that you will encounter when training deep neural networks will be overfitting. What can happen is that your network may, owing to its flexibility, learn patterns that are due to noise, errors, or simply wrong data. [...] The essence of overfitting is to have unknowingly extracted some of the residual variation (i.e., the noise) as if that variation represented the underlying model structure. The opposite is called underfitting - when the model cannot capture the structure of the data." (Umberto Michelucci, "Applied Deep Learning: A Case-Based Approach to Understanding Deep Neural Networks", 2018)

"The premise of classification is simple: given a categorical target variable, learn patterns that exist between instances composed of independent variables and their relationship to the target. Because the target is given ahead of time, classification is said to be supervised machine learning because a model can be trained to minimize error between predicted and actual categories in the training data. Once a classification model is fit, it assigns categorical labels to new instances based on the patterns detected during training." (Benjamin Bengfort et al, "Applied Text Analysis with Python: Enabling Language-Aware Data Products with Machine Learning", 2018)

"The trick is to walk the line between underfitting and overfitting. An underfit model has low variance, generally making the same predictions every time, but with extremely high bias, because the model deviates from the correct answer by a significant amount. Underfitting is symptomatic of not having enough data points, or not training a complex enough model. An overfit model, on the other hand, has memorized the training data and is completely accurate on data it has seen before, but varies widely on unseen data. Neither an overfit nor underfit model is generalizable - that is, able to make meaningful predictions on unseen data." (Benjamin Bengfort et al, "Applied Text Analysis with Python: Enabling Language-Aware Data Products with Machine Learning", 2018)

"There is a trade-off between bias and variance [...]. Complexity increases with the number of features, parameters, depth, training epochs, etc. As complexity increases and the model overfits, the error on the training data decreases, but the error on test data increases, meaning that the model is less generalizable." (Benjamin Bengfort et al, "Applied Text Analysis with Python: Enabling Language-Aware Data Products with Machine Learning", 2018)

"Cross-validation is a useful tool for finding optimal predictive models, and it also works well in visualization. The concept is simple: split the data at random into a 'training' and a 'test' set, fit the model to the training data, then see how well it predicts the test data. As the model gets more complex, it will always fit the training data better and better. It will also start off getting better results on the test data, but there comes a point where the test data predictions start going wrong." (Robert Grant, "Data Visualization: Charts, Maps and Interactive Graphics", 2019)

"Any machine learning model is trained based on certain assumptions. In general, these assumptions are the simplistic approximations of some real-world phenomena. These assumptions simplify the actual relationships between features and their characteristics and make a model easier to train. More assumptions means more bias. So, while training a model, more simplistic assumptions = high bias, and realistic assumptions that are more representative of actual phenomena = low bias." (Imran Ahmad, "40 Algorithms Every Programmer Should Know", 2020)

"In machine learning, 'training' is when we teach models to understand language and code by analyzing massive amounts of data. During training, the model learns statistical patterns - how often certain words appear together, what code structures are common, andhow different parts of text relate to each other. The quality of training data directly affects how well the model performs." (Jeremy C Morgan, "Coding with AI: Examples in Python", 2025)

🔭Data Science: Training Data (Just the Quotes)

"Neural networks are a computing technology whose fundamental purpose is to recognize patterns in data. Based on a computing model similar to the underlying structure of the human brain, neural networks share the brains ability to learn or adapt in response to external inputs. When exposed to a stream of training data, neural networks can discover previously unknown relationships and learn complex nonlinear mappings in the data. Neural networks provide some fundamental, new capabilities for processing business data. However, tapping these new neural network data mining functions requires a completely different application development process from traditional programming." (Joseph P Bigus, "Data Mining with Neural Networks: Solving business problems from application development to decision support", 1996)

"[…] an obvious difference between our best classifiers and human learning is the number of examples required in tasks such as object detection. […] the difficulty of a learning task depends on the size of the required hypothesis space. This complexity determines in turn how many training examples are needed to achieve a given level of generalization error. Thus the complexity of the hypothesis space sets the speed limit and the sample complexity for learning." (Tomaso Poggio & Steve Smale, "The Mathematics of Learning: Dealing with Data", Notices of the AMS, 2003)

"Learning a complicated function that matches the training data closely but fails to recognize the underlying process that generates the data. As a result of overfitting, the model performs poor on new input. Overfitting occurs when the training patterns are sparse in input space and/or the trained networks are too complex." (Frank Padberg, "Counting the Hidden Defects in Software Documents", 2010)

"Overfitting occurs when a formula describes a set of data very closely, but does not lead to any sensible explanation for the behavior of the data and does not predict the behavior of comparable data sets. In the case of overfitting, the formula is said to describe the noise of the system rather than the characteristic behavior of the system. Overfitting occurs frequently with models that perform iterative approximations on training data, coming closer and closer to the training data set with each iteration. Neural networks are an example of a data modeling strategy that is prone to overfitting." (Jules H Berman, "Principles of Big Data: Preparing, Sharing, and Analyzing Complex Information", 2013)

"Briefly speaking, to solve a Machine Learning problem means you optimize a model to fit all the data from your training set, and then you use the model to predict the results you want. Therefore, evaluating a model need to see how well it can be used to predict the data out of the training set. Usually there are three types of the models: underfitting, fair and overfitting model [...]. If we want to predict a value, both (a) and (c) in this figure cannot work well. The underfitting model does not capture the structure of the problem at all, and we say it has high bias. The overfitting model tries to fit every sample in the training set and it did it, but we say it is of high variance. In other words, it fails to generalize new data." (Shudong Hao, "A Beginner’s Tutorial for Machine Learning Beginners", 2014)

"Neural networks can model very complex patterns and decision boundaries in the data and, as such, are very powerful. In fact, they are so powerful that they can even model the noise in the training data, which is something that definitely should be avoided. One way to avoid this overfitting is by using a validation set in a similar way as with decision trees.[...] Another scheme to prevent a neural network from overfitting is weight regularization, whereby the idea is to keep the weights small in absolute sense because otherwise they may be fitting the noise in the data. This is then implemented by adding a weight size term (e.g., Euclidean norm) to the objective function of the neural network." (Bart Baesens, "Analytics in a Big Data World: The Essential Guide to Data Science and Its Applications", 2014)

"A predictive model overfits the training set when at least some of the predictions it returns are based on spurious patterns present in the training data used to induce the model. Overfitting happens for a number of reasons, including sampling variance and noise in the training set. The problem of overfitting can affect any machine learning algorithm; however, the fact that decision tree induction algorithms work by recursively splitting the training data means that they have a natural tendency to segregate noisy instances and to create leaf nodes around these instances. Consequently, decision trees overfit by splitting the data on irrelevant features that only appear relevant due to noise or sampling variance in the training data. The likelihood of overfitting occurring increases as a tree gets deeper because the resulting predictions are based on smaller and smaller subsets as the dataset is partitioned after each feature test in the path." (John D Kelleher et al, "Fundamentals of Machine Learning for Predictive Data Analytics: Algorithms, Worked Examples, and Case Studies", 2015)

"Cross-validation is a method of splitting all of your data into two parts: training and validation. The training data is used to build the machine learning model, whereas the validation data is used to validate that the model is doing what is expected. This increases our ability to find and determine the underlying errors in a model." (Matthew Kirk, "Thoughtful Machine Learning", 2015)

"Tree pruning identifies and removes subtrees within a decision tree that are likely to be due to noise and sample variance in the training set used to induce it. In cases where a subtree is deemed to be overfitting, pruning the subtree means replacing the subtree with a leaf node that makes a prediction based on the majority target feature level (or average target feature value) of the dataset created by merging the instances from all the leaf nodes in the subtree. Obviously, pruning will result in decision trees being created that are not consistent with the training set used to build them. In general, however, we are more interested in creating prediction models that generalize well to new data rather than that are strictly consistent with training data, so it is common to sacrifice consistency for generalization capacity." (John D Kelleher et al, "Fundamentals of Machine Learning for Predictive Data Analytics: Algorithms, Worked Examples, and Case Studies", 2015)

"When memorization happens, you may have the illusion that everything is working well because your machine learning algorithm seems to have fitted the in sample data so well. Instead, problems can quickly become evident when you start having it work with out-of-sample data and you notice that it produces errors in its predictions as well as errors that actually change a lot when you relearn from the same data with a slightly different approach. Overfitting occurs when your algorithm has learned too much from your data, up to the point of mapping curve shapes and rules that do not exist [...]. Any slight change in the procedure or in the training data produces erratic predictions." (John P Mueller & Luca Massaron, "Machine Learning for Dummies", 2016)

"Bias is error from incorrect assumptions built into the model, such as restricting an interpolating function to be linear instead of a higher-order curve. [...] Errors of bias produce underfit models. They do not fit the training data as tightly as possible, were they allowed the freedom to do so. In popular discourse, I associate the word 'bias' with prejudice, and the correspondence is fairly apt: an apriori assumption that one group is inferior to another will result in less accurate predictions than an unbiased one. Models that perform lousy on both training and testing data are underfit." (Steven S Skiena, "The Data Science Design Manual", 2017)

"Bias occurs normally when the model is underfitted and has failed to learn enough from the training data. It is the difference between the mean of the probability distribution and the actual correct value. Hence, the accuracy of the model is different for different data sets (test and training sets). To reduce the bias error, data scientists repeat the model-building process by resampling the data to obtain better prediction values." (Umesh R Hodeghatta & Umesha Nayak, "Business Analytics Using R: A Practical Approach", 2017)

"Boosting defines an objective function to measure the performance of a model given a certain set of parameters. The objective function contains two parts: regularization and training loss, both of which add to one another. The training loss measures how predictive our model is on the training data. The most commonly used training loss function includes mean squared error and logistic regression. The regularization term controls the complexity of the model, which helps avoid overfitting." (Danish Haroon, "Python Machine Learning Case Studies", 2017)

"From a typical training set, many alternative decision trees can be created. As a rule, smaller trees are to be preferred, their main advantages being interpretability, removal of irrelevant and redundant attributes, and lower danger of overfitting noisy training data." (Miroslav Kubat, "An Introduction to Machine Learning" 2nd Ed., 2017)

"High-bias models typically produce simpler models that do not overfit and in those cases the danger is that of underfitting. Models with low-bias are typically more complex and that complexity enables us to represent the training data in a more accurate way. The danger here is that the flexibility provided by higher complexity may end up representing not only a relationship in the data but also the noise. Another way of portraying the bias-variance trade-off is in terms of complexity v simplicity." (Jesús Rogel-Salazar, "Data Science and Analytics with Python", 2017) 

"In machine learning, a model is defined as a function, and we describe the learning function from the training data as inductive learning. Generalization refers to how well the concepts are learned by the model by applying them to data not seen before. The goal of a good machine-learning model is to reduce generalization errors and thus make good predictions on data that the model has never seen." (Umesh R Hodeghatta & Umesha Nayak, "Business Analytics Using R: A Practical Approach", 2017)

"Multilayer perceptrons share with polynomial classifiers one unpleasant property. Theoretically speaking, they are capable of modeling any decision surface, and this makes them prone to overfitting the training data."  (Miroslav Kubat," An Introduction to Machine Learning" 2nd Ed., 2017)

"The danger of overfitting is particularly severe when the training data is not a perfect gold standard. Human class annotations are often subjective and inconsistent, leading boosting to amplify the noise at the expense of the signal. The best boosting algorithms will deal with overfitting though regularization. The goal will be to minimize the number of non-zero coefficients, and avoid large coefficients that place too much faith in any one classifier in the ensemble." (Steven S Skiena, "The Data Science Design Manual", 2017)

"Variance is a prediction error due to different sets of training samples. Ideally, the error should not vary from one training sample to another sample, and the model should be stable enough to handle hidden variations between input and output variables. Normally this occurs with the overfitted model." (Umesh R Hodeghatta & Umesha Nayak, "Business Analytics Using R: A Practical Approach", 2017)

"Variance is error from sensitivity to fluctuations in the training set. If our training set contains sampling or measurement error, this noise introduces variance into the resulting model. [...] Errors of variance result in overfit models: their quest for accuracy causes them to mistake noise for signal, and they adjust so well to the training data that noise leads them astray. Models that do much better on testing data than training data are overfit." (Steven S Skiena, "The Data Science Design Manual", 2017)

"The premise of classification is simple: given a categorical target variable, learn patterns that exist between instances composed of independent variables and their relationship to the target. Because the target is given ahead of time, classification is said to be supervised machine learning because a model can be trained to minimize error between predicted and actual categories in the training data. Once a classification model is fit, it assigns categorical labels to new instances based on the patterns detected during training." (Benjamin Bengfort et al, "Applied Text Analysis with Python: Enabling Language-Aware Data Products with Machine Learning", 2018)

"The trick is to walk the line between underfitting and overfitting. An underfit model has low variance, generally making the same predictions every time, but with extremely high bias, because the model deviates from the correct answer by a significant amount. Underfitting is symptomatic of not having enough data points, or not training a complex enough model. An overfit model, on the other hand, has memorized the training data and is completely accurate on data it has seen before, but varies widely on unseen data. Neither an overfit nor underfit model is generalizable - that is, able to make meaningful predictions on unseen data." (Benjamin Bengfort et al, "Applied Text Analysis with Python: Enabling Language-Aware Data Products with Machine Learning", 2018)

"There is a trade-off between bias and variance [...]. Complexity increases with the number of features, parameters, depth, training epochs, etc. As complexity increases and the model overfits, the error on the training data decreases, but the error on test data increases, meaning that the model is less generalizable." (Benjamin Bengfort et al, "Applied Text Analysis with Python: Enabling Language-Aware Data Products with Machine Learning", 2018)

"Cross-validation is a useful tool for finding optimal predictive models, and it also works well in visualization. The concept is simple: split the data at random into a 'training' and a 'test' set, fit the model to the training data, then see how well it predicts the test data. As the model gets more complex, it will always fit the training data better and better. It will also start off getting better results on the test data, but there comes a point where the test data predictions start going wrong." (Robert Grant, "Data Visualization: Charts, Maps and Interactive Graphics", 2019)

"The classifier accuracy would be extra ordinary when the test data and the training data are overlapping. But when the model is applied to a new data it will fail to show acceptable accuracy. This condition is called as overfitting." (Jesu V  Nayahi J & Gokulakrishnan K, "Medical Image Classification", 2019)

Related Posts Plugin for WordPress, Blogger...

About Me

My photo
Koeln, NRW, Germany
IT Professional with more than 25 years experience in IT in the area of full life-cycle of Web/Desktop/Database Applications Development, Software Engineering, Consultancy, Data Management, Data Quality, Data Migrations, Reporting, ERP implementations & support, Team/Project/IT Management, etc.