Main Page: Difference between revisions

From Essential
Jump to navigation Jump to search
 
(71 intermediate revisions by the same user not shown)
Line 1: Line 1:
[[File:DALL·E 2024-01-06 13.58.36 - Logo for a website named 'Infocepo.com', focusing on cloud computing, AI, IT SRE salaries, and best practices in open source. The design should be mod.png|thumb|right]]
[[File:Infocepo-illustration.jpg|thumb|right]]
'''Discover cloud computing on infocepo.com''':
'''Discover cloud computing on infocepo.com''':
* Master cloud infrastructure
* Master cloud infrastructure
Line 10: Line 10:
<br>
<br>
== AI Tools ==
== AI Tools ==
*[https://chat.openai.com ChatGPT4] - public assistant with learning abilities.
* [https://chat.openai.com ChatGPT4] - Public assistant with learning abilities.
*[https://huggingface.co/spaces/HuggingFaceH4/open_llm_leaderboard LLM] + [https://www.scaleway.com/en/h100-pcie-try-it-now/ GPU H100] + [https://github.com/ynotopec/llm-k8s FastChat] - private assistant and OPENAI API compatible.
* [https://github.com/open-webui/open-webui open-webui] + [https://www.scaleway.com/en/h100-pcie-try-it-now/ GPU H100] + [https://ollama.com Ollama] - Private assistant and API.
*[https://github.com/lm-sys/FastChat/blob/main/docs/langchain_integration.md LANGCHAIN] - RAG.
* [https://github.com/ynotopec/summarize Private summary]
*[https://github.com/ynotopec/summarize private summary]


=== DEV ===
=== DEV ===
*[https://chat.lmsys.org LLM Evaluate]
(28/08/2024)
*[https://huggingface.co/spaces/HuggingFaceH4/open_llm_leaderboard LLM Ranking]
* [https://ollama.com/library LLM Trending]
*[https://huggingface.co/spaces/mteb/leaderboard Embeddings Ranking]
* [https://github.com/search?q=stars%3A%3E15000+forks%3A%3E1500+created%3A%3E2022-06-01&type=repositories&s=updated&o=desc Project Trending]
*[https://github.com/THUDM/CogVLM CogVLM] - Private API for multimodal purposes. Usable with RAG.
* [https://huggingface.co/spaces/HuggingFaceH4/open_llm_leaderboard LLM Ranking]
*[https://ann-benchmarks.com Vectors DB Ranking]
* [https://chat.lmsys.org ChatBot Evaluate]
*[https://github.com/chatchat-space/Langchain-Chatchat Chatchat] - private assistant with RAG capabilities but Chinese language.
* [https://www.perplexity.ai Perplexity AI] - R&D
*[https://www.nvidia.com/en-us/data-center/h200 NVIDIA H200] - KUBERNETES or HPC clusters for DATASCIENCE.
* [https://huggingface.co/models Models Trending]
*[https://www.nvidia.com/fr-fr/geforce/graphics-cards/40-series/rtx-4090 NVIDIA 4090] - GPU card for PRIVATE AI ASSISTANCE.
* [https://github.com/hiyouga/LLaMA-Factory LLM Fine Tuning]
==== INTERESTING LLMs (updated 21/12/2023) ====
* [https://huggingface.co/spaces/mteb/leaderboard Embeddings Ranking]
* Vicuna-33B (private assistant)
* [https://ann-benchmarks.com Vectors DB Ranking]
* Vicuna-13B and Vicuna-7B (16k, summary, RAG)
* [https://www.nvidia.com/en-us/data-center/h100/ NVIDIA H100] - KUBERNETES or HPC clusters for DATASCIENCE.
* cognitivecomputations/dolphin-2.2.1-mistral-7b (processing text)
* [https://www.nvidia.com/fr-fr/geforce/graphics-cards/40-series/rtx-4080-family NVIDIA 4080] - GPU card for private assistance.
* HuggingFaceH4/zephyr-7b-beta (efficient)
* [https://huggingface.co/models?pipeline_tag=image-text-to-text&sort=trending Img2txt Trending]
* FastChat-T5-3B (small devices)
* [https://huggingface.co/spaces/TIGER-Lab/GenAI-Arena Txt2img Evaluate]
* [https://github.com/chatchat-space/Langchain-Chatchat Chatchat] - Private assistant with RAG capabilities in Chinese.
* [https://top500.org/lists/green500/ HPC Efficiency]
 
==== INTERESTING LLMs ====
(23/11/2024)
{| class="wikitable"
! Model
! Comment
|-
| '''parse'''
| gemma2-simpo
|-
| '''RAG'''
| gemma2-simpo
|-
| '''RAG-FR'''
| qwen2.5
|-
| '''code'''
| gemma2-27b, $$
|-
| '''code-completion'''
| deepseek-coder:base
|-
| '''summary'''
| qwen2.5
|-
| '''ai-translate'''
| gemma2, temperature 0
|-
| '''chat-leger'''
| 0.000055 euros/token, gemma2-simpo
|-
| '''chat-lourd'''
| 0.00015 euros/token, gemma2-27b, $$
|-
| '''mannix/gemma2-9b-simpo'''
| OllamaFunctions
|}


=== NEWS ===
=== NEWS ===
*New LLMs :
(04/05/2024)
**VAGOsolutions/SauerkrautLM-SOLAR-Instruct 11B (processing ?)
* [https://www.youtube.com/@lev-selector/videos Very good AI News]
**cookinai/CatMacaroni-Slerp 7B (summary ?)
* For the [https://betterprogramming.pub/color-your-captions-streamlining-live-transcriptions-with-diart-and-openais-whisper-6203350234ef '''transcription'''] in real time with Diart, it is possible to follow the interlocutors.
**berkeley-nest/Starling-LM-7B-alpha
* [https://github.com/openai-translator/openai-translator Translation] tools like Google Translate are becoming popular.
**openchat/openchat_3.5
* [https://www.mouser.fr/ProductDetail/BittWare/RS-GQ-GC1-0109?qs=ST9lo4GX8V2eGrFMeVQmFw%3D%3D '''LLM 10x accelerator'''] and cheaper with GROQ.
*ACCEL : vision IA chip very efficient and powerful.
* [https://opensearch.org/docs/latest/search-plugins/conversational-search Opensearch with LLM]
*IBM NorthPole : an IA chip very efficient and powerful.


=== TRAINING ===
=== TRAINING ===
*[https://www.youtube.com/watch?v=4Bdc55j80l8 TRANSFORMERS ALGORITHM]
* [https://www.youtube.com/watch?v=4Bdc55j80l8 TRANSFORMERS ALGORITHM]
 
=== Cloud Native Install ===
* [https://github.com/ynotopec/gpu-cluster GPU cluster]
* [https://github.com/ynotopec/llm-k8s LLM API]
[[File:AI-API.drawio.png]]


== CLOUD LAB ==
== CLOUD LAB ==
[[file:Infocepo.drawio.png]]
[[File:Infocepo.drawio.png]]
<br><br>
<br><br>
Presenting my [[LAB project]].
Presenting my [[LAB project]].
Line 58: Line 90:
== CLOUD Migration Example ==
== CLOUD Migration Example ==
[[File:Diagram-migration-ORACLE-KVM-v2.drawio.png]]
[[File:Diagram-migration-ORACLE-KVM-v2.drawio.png]]
*1.5d: Infrastructure audit of 82 services ([https://infocepo.com/wiki/index.php/ServerDiff.sh ServerDiff.sh])
* 1.5d: Infrastructure audit of 82 services ([https://infocepo.com/wiki/index.php/ServerDiff.sh ServerDiff.sh])
* 1.5d: Create cloud architecture diagram.
* 1.5d: Compliance check of 2 clouds (6 hypervisors, 6TB memory).
* 1d: Cloud installations.
* 0.5d: Stability check.


*1.5d: Create cloud architecture diagram
*1.5d: Compliance check of 2 clouds (6 hypervisors, 6TB memory)
*1d: Cloud installations
*.5d: Stability check
{| style="border-spacing:0;width:18.12cm;"
{| style="border-spacing:0;width:18.12cm;"
|- style="background-color:#ffc000;border:0.05pt solid #000000;padding:0.049cm;"
|- style="background-color:#ffc000;border:0.05pt solid #000000;padding:0.049cm;"
Line 81: Line 110:
| style="background-color:#d8e4bc;border:0.05pt solid #000000;padding:0.049cm;color:#000000;" |  
| style="background-color:#d8e4bc;border:0.05pt solid #000000;padding:0.049cm;color:#000000;" |  
|-
|-
| style="border:0.05pt solid #000000;padding:0.049cm;color:#000000;" | Power off simultaneous all nodes. Power on simultaneous all nodes.
| style="border:0.05pt solid #000000;padding:0.049cm;color:#000000;" | Power off all nodes simultaneously. Power on all nodes simultaneously.
| style="border:0.05pt solid #000000;padding:0.049cm;color:#000000;" | All resources are started.
| style="border:0.05pt solid #000000;padding:0.049cm;color:#000000;" | All resources are started.
| style="background-color:#d8e4bc;border:0.05pt solid #000000;padding:0.049cm;color:#000000;" |  
| style="background-color:#d8e4bc;border:0.05pt solid #000000;padding:0.049cm;color:#000000;" |  
|-
|-
|}
|}
*1.5d: Cloud automation study
* 1.5d: Cloud automation study.
 
* 1.5d: Develop 6 templates (2 clouds, 2 OS, 8 environments, 2 versions).
*1.5d: Develop 6 templates (2 clouds, 2 OS, 8 environments, 2 versions)
* 1d: Create migration diagram.
 
* 1.5d: Write 138 lines of migration code ([https://infocepo.com/wiki/index.php/MigrationApp.sh MigrationApp.sh]).
*1d: Create migration diagram
* 1.5d: Process stabilization.
 
* 1.5d: Cloud vs. old infrastructure benchmark.
*1.5d: Write 138 lines of migration code ([https://infocepo.com/wiki/index.php/MigrationApp.sh MigrationApp.sh])
* 0.5d: Unavailability time calibration per migration unit.
 
* 5 min: Load 82 VMs (env, OS, application code, 2 IPs).
*1.5d: Process stabilization
 
*1.5d: Cloud vs old infrastructure benchmark
 
*.5d: Unavailability time calibration per migration unit
 
*5min: Load 82 VMs (env, os, application_code, 2 IP)


Total = 15 man-days
Total = 15 man-days.


== CLOUD Enhancement ==
== WEB Enhancement ==
[[File:WebModelDiagram.drawio.png]]
[[File:WebModelDiagram.drawio.png]]


Line 113: Line 135:
* Employ efficient frontend TLS decoder like HAPROXY.
* Employ efficient frontend TLS decoder like HAPROXY.
* Opt for fast HTTP cache like VARNISH and Apache Traffic Server for large files.
* Opt for fast HTTP cache like VARNISH and Apache Traffic Server for large files.
* Use REVERSE PROXY with TLS decoder like ENVOY for service compatibility.
* Use PROXY with TLS decoder like ENVOY for service compatibility.
* Consider serverless service for standard runtimes, mindful of potential incompatibilities.
* Consider serverless services for standard runtimes, mindful of potential incompatibilities.
* Employ load balancing or native services for dynamic computing power.
* Employ load balancing or native services for dynamic computing power.
* Use open source STACKs where possible.
* Use open-source STACKs where possible.
* Employ database caches like MEMCACHED.
* Employ database caches like MEMCACHED.
* More information at [https://wikitech.wikimedia.org/wiki/Wikimedia_infrastructure CLOUD WIKIPEDIA].
* Use queues for long batches.
* Use buffers for stability of real streams.
* More information at [https://wikitech.wikimedia.org/wiki/Wikimedia_infrastructure CLOUD WIKIPEDIA] and [https://github.com/systemdesign42/system-design GITHUB].


== CLOUD WIKIPEDIA ==
== CLOUD WIKIPEDIA ==
Line 210: Line 234:
== CLOUD providers ==
== CLOUD providers ==
* [https://cloud.google.com/free/docs/aws-azure-gcp-service-comparison CLOUD providers]
* [https://cloud.google.com/free/docs/aws-azure-gcp-service-comparison CLOUD providers]
== CLOUD INTERNET NETWORK ==
== CLOUD INTERNET NETWORK ==
* [https://global-internet-map-2021.telegeography.com/ CLOUD INTERNET NETWORK]
* [https://global-internet-map-2021.telegeography.com/ CLOUD INTERNET NETWORK]
== CLOUD NATIVE ==
== CLOUD NATIVE ==
* [https://landscape.cncf.io/?fullscreen=yes OFFICIAL STACKS]
* [https://landscape.cncf.io/?fullscreen=yes OFFICIAL STACKS]
Line 222: Line 248:
=== Typical Architecture ===
=== Typical Architecture ===


*Dual-room.
* Dual-room.
*IPMI LAN (fencing).
* IPMI LAN (fencing).
*NTP, DNS+DHCP+PXE+TFTP+HTTP (auto-provisioning), PROXY (updates or internal REPOSITORY).
* NTP, DNS+DHCP+PXE+TFTP+HTTP (auto-provisioning), PROXY (updates or internal REPOSITORY).
*Choose 2+ node clusters.
* Choose 2+ node clusters.
*For 2-node, require COROSYNC 2-node config, 10-second staggered closing for stability. But for better stability choose 3+ nodes architecture.
* For 2-node, require COROSYNC 2-node config, 10-second staggered closing for stability. For better stability, choose 3+ nodes architecture.
*Allocate 4GB/base for DB resources. CPU resource requirements generally low.
* Allocate 4GB/base for DB resources. CPU resource requirements are generally low.


=== Typical Service Pattern ===
=== Typical Service Pattern ===
*Multipath
* Multipath
*LUN
* LUN
*LVM (LVM resource)
* LVM (LVM resource)
*FS (FS resource)
* FS (FS resource)
*NFS (FS resource)
* NFS (FS resource)
*User
* User
*IP (IP resource)
* IP (IP resource)
*DNS name
* DNS name
*Process (Process resource)
* Process (Process resource)
*Listener (Listener resource)
* Listener (Listener resource)
 
== HPC ==
[[File:HPC.drawio.png]]
 
== IT Wage ==
* [http://jobsearchtech.about.com/od/educationfortechcareers/tp/HighestCerts.htm Best IT certifications]
* [https://www.silkhom.com/barometre-2021-des-tjm-dans-informatique-digital FREELANCE]
* [http://www.journaldunet.com/solutions/emploi-rh/salaire-dans-l-informatique-hays IT]


== IT salaries ==
*[http://jobsearchtech.about.com/od/educationfortechcareers/tp/HighestCerts.htm Best IT certifications]
*[https://www.silkhom.com/barometre-2021-des-tjm-dans-informatique-digital/ FREELANCE]
*[http://www.journaldunet.com/solutions/emploi-rh/salaire-dans-l-informatique-hays/ IT]
== SRE ==
== SRE ==
* [https://openapm.io SRE]
* [https://openapm.io SRE]
== REDHAT package browser ==
 
* [https://access.redhat.com/downloads/content/package-browser REDHAT package browser]
== REDHAT Package Browser ==
* [https://access.redhat.com/downloads/content/package-browser REDHAT Package Browser]

Latest revision as of 14:48, 23 November 2024

Infocepo-illustration.jpg

Discover cloud computing on infocepo.com:

  • Master cloud infrastructure
  • Explore AI
  • Compare Kubernetes and AWS
  • Advance your IT skills with hands-on labs and open-source software.

Start your journey to expertise.


AI Tools

DEV

(28/08/2024)

INTERESTING LLMs

(23/11/2024)

Model Comment
parse gemma2-simpo
RAG gemma2-simpo
RAG-FR qwen2.5
code gemma2-27b, $$
code-completion deepseek-coder:base
summary qwen2.5
ai-translate gemma2, temperature 0
chat-leger 0.000055 euros/token, gemma2-simpo
chat-lourd 0.00015 euros/token, gemma2-27b, $$
mannix/gemma2-9b-simpo OllamaFunctions

NEWS

(04/05/2024)

TRAINING

CLOUD LAB

Infocepo.drawio.png

Presenting my LAB project.

CLOUD Audit

Created ServerDiff.sh for server audits. Enables configuration drift tracking and environment consistency checks.

CLOUD Migration Example

Diagram-migration-ORACLE-KVM-v2.drawio.png

  • 1.5d: Infrastructure audit of 82 services (ServerDiff.sh)
  • 1.5d: Create cloud architecture diagram.
  • 1.5d: Compliance check of 2 clouds (6 hypervisors, 6TB memory).
  • 1d: Cloud installations.
  • 0.5d: Stability check.
ACTION RESULT OK/KO
Activate maintenance for n/2-1 nodes or 1 node if 2 nodes. All resources are started.
Un-maintenance all nodes. Power off n/2-1 nodes or 1 node if 2 nodes, different from the previous test. All resources are started.
Power off all nodes simultaneously. Power on all nodes simultaneously. All resources are started.
  • 1.5d: Cloud automation study.
  • 1.5d: Develop 6 templates (2 clouds, 2 OS, 8 environments, 2 versions).
  • 1d: Create migration diagram.
  • 1.5d: Write 138 lines of migration code (MigrationApp.sh).
  • 1.5d: Process stabilization.
  • 1.5d: Cloud vs. old infrastructure benchmark.
  • 0.5d: Unavailability time calibration per migration unit.
  • 5 min: Load 82 VMs (env, OS, application code, 2 IPs).

Total = 15 man-days.

WEB Enhancement

WebModelDiagram.drawio.png

  • Formalize infrastructure for flexibility and reduced complexity.
  • Utilize customer-location tracking name server like GDNS.
  • Use minimal instances with a network load balancer like LVS.
  • Compare prices of dynamic computing services, beware of tech lock-in.
  • Employ efficient frontend TLS decoder like HAPROXY.
  • Opt for fast HTTP cache like VARNISH and Apache Traffic Server for large files.
  • Use PROXY with TLS decoder like ENVOY for service compatibility.
  • Consider serverless services for standard runtimes, mindful of potential incompatibilities.
  • Employ load balancing or native services for dynamic computing power.
  • Use open-source STACKs where possible.
  • Employ database caches like MEMCACHED.
  • Use queues for long batches.
  • Use buffers for stability of real streams.
  • More information at CLOUD WIKIPEDIA and GITHUB.

CLOUD WIKIPEDIA

CLOUD vs HW

Function Kubernetes OpenStack AWS Bare-metal HPC CRM oVirt
Deployment Tools
(Tools used for deployment)
Helm, YAML, Operator, Ansible, Juju, ArgoCD Ansible, Packer, Terraform, Juju Ansible, Terraform, CloudFormation, Juju Ansible, Shell Scripts xCAT, Clush Ansible, Shell Scripts Ansible, Python, Shell Scripts
Bootstrap Method
(Initial configuration and setup)
API API, PXE API PXE, IPMI PXE, IPMI PXE, IPMI PXE, API
Router Control
(Routing services)
API (Kube-router) API (Router/Subnet) API (Route Table/Subnet) Linux, OVS, External Hardware xCAT, External Hardware Linux, External Hardware API
Firewall Control
(Firewall rules and policies)
Ingress, Egress, Istio, NetworkPolicy API (Security Groups) API (Security Group) Linux Firewall Linux Firewall Linux Firewall API
Network Virtualization
(VLAN/VxLAN technologies)
Multiple Options VPC VPC OVS, Linux, External Hardware xCAT, External Hardware Linux, External Hardware API
Name Server Control
(DNS services)
CoreDNS DNS-Nameserver Amazon Route 53 GDNS xCAT Linux, External Hardware API, External Hardware
Load Balancer
(Load balancing options)
Kube-proxy, LVS (IPVS) LVS Network Load Balancer LVS SLURM Ldirectord N/A
Storage Options
(Available storage technologies)
Multiple Options Swift, Cinder, Nova S3, EFS, FSx, EBS Swift, XFS, EXT4, RAID10 GPFS SAN NFS, SAN

CLOUD providers

CLOUD INTERNET NETWORK

CLOUD NATIVE

DSO-POC-V3.drawio.png

High Availability (HA) with Corosync+Pacemaker

HA-REF.drawio.png

Typical Architecture

  • Dual-room.
  • IPMI LAN (fencing).
  • NTP, DNS+DHCP+PXE+TFTP+HTTP (auto-provisioning), PROXY (updates or internal REPOSITORY).
  • Choose 2+ node clusters.
  • For 2-node, require COROSYNC 2-node config, 10-second staggered closing for stability. For better stability, choose 3+ nodes architecture.
  • Allocate 4GB/base for DB resources. CPU resource requirements are generally low.

Typical Service Pattern

  • Multipath
  • LUN
  • LVM (LVM resource)
  • FS (FS resource)
  • NFS (FS resource)
  • User
  • IP (IP resource)
  • DNS name
  • Process (Process resource)
  • Listener (Listener resource)

HPC

HPC.drawio.png

IT Wage

SRE

REDHAT Package Browser