Library & Data - amuzetnoM/artifactvirtual GitHub Wiki
This directory contains carefully curated datasets that serve as knowledge foundations for the Artifact Virtual ecosystem. These datasets provide structured facts, reasoning patterns, language primitives, and temporal relationships that support reproducible research and AI development.
Overview
The datasets have been selected for their foundational value in understanding intelligence and supporting reproducible research. The directory is organized into:
- Core foundational datasets - Essential factual knowledge across domains
- Library of Immutable Knowledge - Indexed directory of knowledge sources
- Custom annotations - Project-specific insights and findings
Dataset Contents
core_facts.json
Origin: Curated from encyclopedic sources and foundational scientific literature.
Content: Universal facts across mathematics, physics, biology, and logic.
Purpose: Serves as the backbone for reasoning and inference tasks.
reasoning_patterns.json
Origin: Synthesized from cognitive science and AI research papers.
Content: Common reasoning templates and logical deduction patterns.
Purpose: Enables systems to generalize and apply structured reasoning.
language_primitives.json
Origin: Extracted from linguistic corpora and language model benchmarks.
Content: Essential language constructs, grammar rules, and semantic primitives.
Purpose: Supports robust natural language understanding and generation.
temporal_events.json
Origin: Aggregated from historical datasets and time-series research.
Content: Key events, timelines, and temporal relationships.
Purpose: Facilitates temporal reasoning and event-based diagnostics.
custom_annotations.json
Origin: Manually annotated during project development and research.
Content: Project-specific insights, edge cases, and experimental findings.
Purpose: Captures evolving knowledge and supports continuous improvement.
Library of Immutable Knowledge
The Library of Immutable Knowledge is a comprehensive, indexed source of foundational and advanced knowledge across all domains. It provides carefully curated references to:
- Fundamental Physics & Cosmology
- Core Mathematics & Logic
- Biological Systems & Consciousness
- Comprehensive Knowledge Repositories
- Philosophy, Computation & Limits of Knowledge
- Knowledge Aggregators & Educational Platforms
- Integrative & Specialized Resources
Usage Guidelines
Loading Datasets
import json
# Load core facts
with open('datasets/core_facts.json', 'r', encoding='utf-8') as f:
core_facts = json.load(f)
# Access knowledge by category
math_facts = core_facts.get('mathematics', [])
physics_facts = core_facts.get('physics', [])
Contributing to Datasets
When adding new information to the datasets:
- Verify information from multiple reliable sources
- Include source citations for all new entries
- Maintain the existing JSON schema structure
- Add appropriate metadata (timestamp, confidence, etc.)
Integration with Artifact Virtual
These datasets are used throughout the Artifact Virtual ecosystem:
- They provide grounding knowledge for LLMs via the Model Context Protocol
- They support the development of reasoning capabilities in the Oracle CLI
- They enable verification of the TemporalCalendar examples
- They serve as calibration data for AutoRound model quantization
Versioning and Maintenance
All datasets are versioned and documented for absolute clarity, reproducibility, and educational value. When updating datasets:
- Increment the version number in the metadata
- Document changes in the changelog
- Note any implications for dependent systems
Library of (somewhat) Immutable Knowledge
=============================================
Welcome to the comprehensive, indexed source of foundational and advanced knowledge across all domains. Use the table of contents for quick navigation.
Table of Contents
- Fundamental Physics & Cosmology
- Core Mathematics & Logic
- Biological Systems & Consciousness
- Comprehensive Knowledge Repositories
- Philosophy, Computation & Limits of Knowledge
- Knowledge Aggregators & Educational Platforms
- Integrative & Specialized Resources
1. Fundamental Physics & Cosmology
-
The Standard Model of Particle Physics
-
General Relativity & Quantum Field Theory
-
Thermodynamics & Information Theory
2. Core Mathematics & Logic
-
Mathematical Structures
-
Logic & Foundations
3. Biological Systems & Consciousness
-
Genetics & Genomics
-
Neuroscience & Consciousness
-
Evolution & Systems Biology
4. Comprehensive Knowledge Repositories
-
General Reference
-
Academic & Scientific Literature
-
Computational & Data Resources
-
Books & Texts
5. Philosophy, Computation & Limits of Knowledge
-
Philosophy of Science & Epistemology
-
Computational Theory
-
Limits of Knowledge
6. Knowledge Aggregators & Educational Platforms
-
Online Courses & Lectures
-
Video & Interactive Learning
-
Programming & Computer Science
7. Integrative & Specialized Resources
-
Philosophy & Humanities
-
Science Journalism & Magazines
-
Language & Reference
-
Open Data & Statistics
8. Questionable Sources (Needs Verification)
The following links were either automatically automatically generated or suggested and require careful verification before being considered reliable sources.
Fundamental Physics & Cosmology
- The Standard Model of Particle Physics
- General Relativity & Quantum Field Theory
- Thermodynamics & Information Theory
Core Mathematics & Logic
- Mathematical Structures
- Gödel’s Incompleteness Theorems
Biological Systems & Consciousness
- The Human Genome Project
- Neuroscience & Consciousness
Comprehensive Knowledge Repositories
- Wikipedia
- The New Yorker
- arXiv
- PubMed
- Wolfram Alpha
Philosophy, Computation & Limits of Knowledge
- Philosophy of Science & Epistemology
- Computational Theory
Knowledge Aggregators & Educational Platforms
- Khan Academy
- MIT OpenCourseWare
- Coursera
Integrative & Specialized Resources
- Stanford Encyclopedia of Philosophy
- Internet Encyclopedia of Philosophy
- Project Gutenberg
"Knowledge is not a collection of facts, but a network of understanding." This library is a living document. Contributions and suggestions are welcome!