Main Page: Difference between revisions

Latest revision as of 14:48, 23 November 2024

Discover cloud computing on infocepo.com:

Master cloud infrastructure
Explore AI
Compare Kubernetes and AWS
Advance your IT skills with hands-on labs and open-source software.

Start your journey to expertise.

AI Tools

ChatGPT4 - Public assistant with learning abilities.
open-webui + GPU H100 + Ollama - Private assistant and API.
Private summary

DEV

(28/08/2024)

LLM Trending
Project Trending
LLM Ranking
ChatBot Evaluate
Perplexity AI - R&D
Models Trending
LLM Fine Tuning
Embeddings Ranking
Vectors DB Ranking
NVIDIA H100 - KUBERNETES or HPC clusters for DATASCIENCE.
NVIDIA 4080 - GPU card for private assistance.
Img2txt Trending
Txt2img Evaluate
Chatchat - Private assistant with RAG capabilities in Chinese.
HPC Efficiency

INTERESTING LLMs

(23/11/2024)

Model	Comment
parse	gemma2-simpo
RAG	gemma2-simpo
RAG-FR	qwen2.5
code	gemma2-27b, $$
code-completion	deepseek-coder:base
summary	qwen2.5
ai-translate	gemma2, temperature 0
chat-leger	0.000055 euros/token, gemma2-simpo
chat-lourd	0.00015 euros/token, gemma2-27b, $$
mannix/gemma2-9b-simpo	OllamaFunctions

NEWS

(04/05/2024)

Very good AI News
For the transcription in real time with Diart, it is possible to follow the interlocutors.
Translation tools like Google Translate are becoming popular.
LLM 10x accelerator and cheaper with GROQ.
Opensearch with LLM

TRAINING

TRANSFORMERS ALGORITHM

CLOUD LAB

Presenting my LAB project.

CLOUD Audit

Created ServerDiff.sh for server audits. Enables configuration drift tracking and environment consistency checks.

CLOUD Migration Example

1.5d: Infrastructure audit of 82 services (ServerDiff.sh)
1.5d: Create cloud architecture diagram.
1.5d: Compliance check of 2 clouds (6 hypervisors, 6TB memory).
1d: Cloud installations.
0.5d: Stability check.

ACTION	RESULT	OK/KO
Activate maintenance for n/2-1 nodes or 1 node if 2 nodes.	All resources are started.
Un-maintenance all nodes. Power off n/2-1 nodes or 1 node if 2 nodes, different from the previous test.	All resources are started.
Power off all nodes simultaneously. Power on all nodes simultaneously.	All resources are started.

1.5d: Cloud automation study.
1.5d: Develop 6 templates (2 clouds, 2 OS, 8 environments, 2 versions).
1d: Create migration diagram.
1.5d: Write 138 lines of migration code (MigrationApp.sh).
1.5d: Process stabilization.
1.5d: Cloud vs. old infrastructure benchmark.
0.5d: Unavailability time calibration per migration unit.
5 min: Load 82 VMs (env, OS, application code, 2 IPs).

Total = 15 man-days.

WEB Enhancement

Formalize infrastructure for flexibility and reduced complexity.
Utilize customer-location tracking name server like GDNS.
Use minimal instances with a network load balancer like LVS.
Compare prices of dynamic computing services, beware of tech lock-in.
Employ efficient frontend TLS decoder like HAPROXY.
Opt for fast HTTP cache like VARNISH and Apache Traffic Server for large files.
Use PROXY with TLS decoder like ENVOY for service compatibility.
Consider serverless services for standard runtimes, mindful of potential incompatibilities.
Employ load balancing or native services for dynamic computing power.
Use open-source STACKs where possible.
Employ database caches like MEMCACHED.
Use queues for long batches.
Use buffers for stability of real streams.
More information at CLOUD WIKIPEDIA and GITHUB.

CLOUD WIKIPEDIA

CLOUD WIKIPEDIA

CLOUD vs HW

Function	Kubernetes	OpenStack	AWS	Bare-metal	HPC	CRM	oVirt
Deployment Tools (Tools used for deployment)	Helm, YAML, Operator, Ansible, Juju, ArgoCD	Ansible, Packer, Terraform, Juju	Ansible, Terraform, CloudFormation, Juju	Ansible, Shell Scripts	xCAT, Clush	Ansible, Shell Scripts	Ansible, Python, Shell Scripts
Bootstrap Method (Initial configuration and setup)	API	API, PXE	API	PXE, IPMI	PXE, IPMI	PXE, IPMI	PXE, API
Router Control (Routing services)	API (Kube-router)	API (Router/Subnet)	API (Route Table/Subnet)	Linux, OVS, External Hardware	xCAT, External Hardware	Linux, External Hardware	API
Firewall Control (Firewall rules and policies)	Ingress, Egress, Istio, NetworkPolicy	API (Security Groups)	API (Security Group)	Linux Firewall	Linux Firewall	Linux Firewall	API
Network Virtualization (VLAN/VxLAN technologies)	Multiple Options	VPC	VPC	OVS, Linux, External Hardware	xCAT, External Hardware	Linux, External Hardware	API
Name Server Control (DNS services)	CoreDNS	DNS-Nameserver	Amazon Route 53	GDNS	xCAT	Linux, External Hardware	API, External Hardware
Load Balancer (Load balancing options)	Kube-proxy, LVS (IPVS)	LVS	Network Load Balancer	LVS	SLURM	Ldirectord	N/A
Storage Options (Available storage technologies)	Multiple Options	Swift, Cinder, Nova	S3, EFS, FSx, EBS	Swift, XFS, EXT4, RAID10	GPFS	SAN	NFS, SAN

CLOUD providers

CLOUD providers

CLOUD INTERNET NETWORK

CLOUD INTERNET NETWORK

CLOUD NATIVE

OFFICIAL STACKS
DevSecOps :

High Availability (HA) with Corosync+Pacemaker

Typical Architecture

Dual-room.
IPMI LAN (fencing).
NTP, DNS+DHCP+PXE+TFTP+HTTP (auto-provisioning), PROXY (updates or internal REPOSITORY).
Choose 2+ node clusters.
For 2-node, require COROSYNC 2-node config, 10-second staggered closing for stability. For better stability, choose 3+ nodes architecture.
Allocate 4GB/base for DB resources. CPU resource requirements are generally low.

Typical Service Pattern

Multipath
LUN
LVM (LVM resource)
FS (FS resource)
NFS (FS resource)
User
IP (IP resource)
DNS name
Process (Process resource)
Listener (Listener resource)

@@ Line 10: / Line 10: @@
 <br>
 == AI Tools ==
-*[https://chat.openai.com ChatGPT4] - public assistant with learning abilities.
+* [https://chat.openai.com ChatGPT4] - Public assistant with learning abilities.
-*[https://github.com/open-webui/open-webui open-webui] + [https://www.scaleway.com/en/h100-pcie-try-it-now/ GPU H100] + [https://ollama.com Ollama] - private assistant and API.
+* [https://github.com/open-webui/open-webui open-webui] + [https://www.scaleway.com/en/h100-pcie-try-it-now/ GPU H100] + [https://ollama.com Ollama] - Private assistant and API.
-*[https://github.com/ynotopec/summarize private summary]
+* [https://github.com/ynotopec/summarize Private summary]
 === DEV ===
-(22/03/2024)
+(28/08/2024)
-*[https://github.com/hiyouga/LLaMA-Factory LLM Fine Tuning]
+* [https://ollama.com/library LLM Trending]
-*[https://huggingface.co/models Models Trending]
+* [https://github.com/search?q=stars%3A%3E15000+forks%3A%3E1500+created%3A%3E2022-06-01&type=repositories&s=updated&o=desc Project Trending]
-*[https://github.com/trending Project Trending]
+* [https://huggingface.co/spaces/HuggingFaceH4/open_llm_leaderboard LLM Ranking]
-*[https://chat.lmsys.org ChatBot Evaluate]
+* [https://chat.lmsys.org ChatBot Evaluate]
-*[https://huggingface.co/spaces/HuggingFaceH4/open_llm_leaderboard LLM Ranking]
+* [https://www.perplexity.ai Perplexity AI] - R&D
-*[https://huggingface.co/spaces/mteb/leaderboard Embeddings Ranking]
+* [https://huggingface.co/models Models Trending]
-*[https://huggingface.co/spaces/TIGER-Lab/GenAI-Arena Image Evaluate]
+* [https://github.com/hiyouga/LLaMA-Factory LLM Fine Tuning]
-*[https://www.perplexity.ai Perplexity AI] - R&D
+* [https://huggingface.co/spaces/mteb/leaderboard Embeddings Ranking]
-*[https://github.com/THUDM/CogVLM CogVLM] - Private API for multimodal purposes. Usable with RAG.
+* [https://ann-benchmarks.com Vectors DB Ranking]
-*[https://ann-benchmarks.com Vectors DB Ranking]
+* [https://www.nvidia.com/en-us/data-center/h100/ NVIDIA H100] - KUBERNETES or HPC clusters for DATASCIENCE.
-*[https://github.com/chatchat-space/Langchain-Chatchat Chatchat] - private assistant with RAG capabilities but Chinese language.
+* [https://www.nvidia.com/fr-fr/geforce/graphics-cards/40-series/rtx-4080-family NVIDIA 4080] - GPU card for private assistance.
-*[https://www.nvidia.com/en-us/data-center/h200 NVIDIA H200] - KUBERNETES or HPC clusters for DATASCIENCE.
+* [https://huggingface.co/models?pipeline_tag=image-text-to-text&sort=trending Img2txt Trending]
-*[https://www.nvidia.com/fr-fr/geforce/graphics-cards/40-series/rtx-4080-family NVIDIA 4080] - GPU card for private assistance.
+* [https://huggingface.co/spaces/TIGER-Lab/GenAI-Arena Txt2img Evaluate]
+* [https://github.com/chatchat-space/Langchain-Chatchat Chatchat] - Private assistant with RAG capabilities in Chinese.
+* [https://top500.org/lists/green500/ HPC Efficiency]
 ==== INTERESTING LLMs ====
-(22/03/2024)
+(23/11/2024)
-* Vicuna-33B (private assistant)
+{| class="wikitable"
-* Qwen-14B (32k, RAG)
+! Model
-* Vicuna-7B (summary)
+! Comment
+|-
+| '''parse'''
+| gemma2-simpo
+|-
+| '''RAG'''
+| gemma2-simpo
+|-
+| '''RAG-FR'''
+| qwen2.5
+|-
+| '''code'''
+| gemma2-27b, $$
+|-
+| '''code-completion'''
+| deepseek-coder:base
+|-
+| '''summary'''
+| qwen2.5
+|-
+| '''ai-translate'''
+| gemma2, temperature 0
+|-
+| '''chat-leger'''
+| 0.000055 euros/token, gemma2-simpo
+|-
+| '''chat-lourd'''
+| 0.00015 euros/token, gemma2-27b, $$
+|-
+| '''mannix/gemma2-9b-simpo'''
+| OllamaFunctions
+|}
 === NEWS ===
-(07/04/2024)
+(04/05/2024)
 * [https://www.youtube.com/@lev-selector/videos Very good AI News]
-* LLM + VISION [https://huggingface.co/deepseek-ai/deepseek-vl-7b-chat deepseek-ai/deepseek-vl-7b-chat]
+* For the [https://betterprogramming.pub/color-your-captions-streamlining-live-transcriptions-with-diart-and-openais-whisper-6203350234ef '''transcription'''] in real time with Diart, it is possible to follow the interlocutors.
-* LLM [https://huggingface.co/01-ai/Yi-34B-200K Yi-34B 200k] for long context available
+* [https://github.com/openai-translator/openai-translator Translation] tools like Google Translate are becoming popular.
-* Small vision language model [https://huggingface.co/vikhyatk/moondream2 moondream2] for embedded systems. Not yet available under Ollama
+* [https://www.mouser.fr/ProductDetail/BittWare/RS-GQ-GC1-0109?qs=ST9lo4GX8V2eGrFMeVQmFw%3D%3D '''LLM 10x accelerator'''] and cheaper with GROQ.
-* For the [https://betterprogramming.pub/color-your-captions-streamlining-live-transcriptions-with-diart-and-openais-whisper-6203350234ef '''transcription'''] real time with Diart it is possible to follow the interlocutors
-* [https://github.com/openai-translator/openai-translator translation] tools like Google translate are becoming popular
-* Claude 3 beats ChatGPT4? (with these [https://infocepo.com/wiki/index.php/Enigme Enigmes], no)
-* [https://www.mouser.fr/ProductDetail/BittWare/RS-GQ-GC1-0109?qs=ST9lo4GX8V2eGrFMeVQmFw%3D%3D '''LLM 10x accelerator'''] and cheaper with GROQ
 * [https://opensearch.org/docs/latest/search-plugins/conversational-search Opensearch with LLM]
-* ACCEL : vision IA chip very efficient and powerful.
-* IBM NorthPole : an IA chip very efficient and powerful.
 === TRAINING ===
-*[https://www.youtube.com/watch?v=4Bdc55j80l8 TRANSFORMERS ALGORITHM]
+* [https://www.youtube.com/watch?v=4Bdc55j80l8 TRANSFORMERS ALGORITHM]
-=== Cloud Native Install ===
-* [https://github.com/ynotopec/gpu-cluster GPU cluster]
-* [https://github.com/ynotopec/llm-k8s LLM API]
-[[File:AI-API.drawio.png]]
 == CLOUD LAB ==
-[[file:Infocepo.drawio.png]]
+[[File:Infocepo.drawio.png]]
 <br><br>
 Presenting my [[LAB project]].
@@ Line 67: / Line 90: @@
 == CLOUD Migration Example ==
 [[File:Diagram-migration-ORACLE-KVM-v2.drawio.png]]
-*1.5d: Infrastructure audit of 82 services ([https://infocepo.com/wiki/index.php/ServerDiff.sh ServerDiff.sh])
+* 1.5d: Infrastructure audit of 82 services ([https://infocepo.com/wiki/index.php/ServerDiff.sh ServerDiff.sh])
+* 1.5d: Create cloud architecture diagram.
+* 1.5d: Compliance check of 2 clouds (6 hypervisors, 6TB memory).
+* 1d: Cloud installations.
+* 0.5d: Stability check.
-*1.5d: Create cloud architecture diagram
-*1.5d: Compliance check of 2 clouds (6 hypervisors, 6TB memory)
-*1d: Cloud installations
-*.5d: Stability check
 {| style="border-spacing:0;width:18.12cm;"
 |- style="background-color:#ffc000;border:0.05pt solid #000000;padding:0.049cm;"
@@ Line 90: / Line 110: @@
 | style="background-color:#d8e4bc;border:0.05pt solid #000000;padding:0.049cm;color:#000000;" |
 |-
-| style="border:0.05pt solid #000000;padding:0.049cm;color:#000000;" | Power off simultaneous all nodes. Power on simultaneous all nodes.
+| style="border:0.05pt solid #000000;padding:0.049cm;color:#000000;" | Power off all nodes simultaneously. Power on all nodes simultaneously.
 | style="border:0.05pt solid #000000;padding:0.049cm;color:#000000;" | All resources are started.
 | style="background-color:#d8e4bc;border:0.05pt solid #000000;padding:0.049cm;color:#000000;" |
 |-
 |}
-*1.5d: Cloud automation study
+* 1.5d: Cloud automation study.
+* 1.5d: Develop 6 templates (2 clouds, 2 OS, 8 environments, 2 versions).
-*1.5d: Develop 6 templates (2 clouds, 2 OS, 8 environments, 2 versions)
+* 1d: Create migration diagram.
+* 1.5d: Write 138 lines of migration code ([https://infocepo.com/wiki/index.php/MigrationApp.sh MigrationApp.sh]).
-*1d: Create migration diagram
+* 1.5d: Process stabilization.
+* 1.5d: Cloud vs. old infrastructure benchmark.
-*1.5d: Write 138 lines of migration code ([https://infocepo.com/wiki/index.php/MigrationApp.sh MigrationApp.sh])
+* 0.5d: Unavailability time calibration per migration unit.
+* 5 min: Load 82 VMs (env, OS, application code, 2 IPs).
-*1.5d: Process stabilization
-*1.5d: Cloud vs old infrastructure benchmark
-*.5d: Unavailability time calibration per migration unit
-*5min: Load 82 VMs (env, os, application_code, 2 IP)
- Total = 15 man-days
+Total = 15 man-days.
 == WEB Enhancement ==
@@ Line 123: / Line 136: @@
 * Opt for fast HTTP cache like VARNISH and Apache Traffic Server for large files.
 * Use PROXY with TLS decoder like ENVOY for service compatibility.
-* Consider serverless service for standard runtimes, mindful of potential incompatibilities.
+* Consider serverless services for standard runtimes, mindful of potential incompatibilities.
 * Employ load balancing or native services for dynamic computing power.
-* Use open source STACKs where possible.
+* Use open-source STACKs where possible.
 * Employ database caches like MEMCACHED.
-* Use queues for long batch.
+* Use queues for long batches.
 * Use buffers for stability of real streams.
 * More information at [https://wikitech.wikimedia.org/wiki/Wikimedia_infrastructure CLOUD WIKIPEDIA] and [https://github.com/systemdesign42/system-design GITHUB].
@@ Line 221: / Line 234: @@
 == CLOUD providers ==
 * [https://cloud.google.com/free/docs/aws-azure-gcp-service-comparison CLOUD providers]
 == CLOUD INTERNET NETWORK ==
 * [https://global-internet-map-2021.telegeography.com/ CLOUD INTERNET NETWORK]
 == CLOUD NATIVE ==
 * [https://landscape.cncf.io/?fullscreen=yes OFFICIAL STACKS]
@@ Line 233: / Line 248: @@
 === Typical Architecture ===
-*Dual-room.
+* Dual-room.
-*IPMI LAN (fencing).
+* IPMI LAN (fencing).
-*NTP, DNS+DHCP+PXE+TFTP+HTTP (auto-provisioning), PROXY (updates or internal REPOSITORY).
+* NTP, DNS+DHCP+PXE+TFTP+HTTP (auto-provisioning), PROXY (updates or internal REPOSITORY).
-*Choose 2+ node clusters.
+* Choose 2+ node clusters.
-*For 2-node, require COROSYNC 2-node config, 10-second staggered closing for stability. But for better stability choose 3+ nodes architecture.
+* For 2-node, require COROSYNC 2-node config, 10-second staggered closing for stability. For better stability, choose 3+ nodes architecture.
-*Allocate 4GB/base for DB resources. CPU resource requirements generally low.
+* Allocate 4GB/base for DB resources. CPU resource requirements are generally low.
 === Typical Service Pattern ===
-*Multipath
+* Multipath
-*LUN
+* LUN
-*LVM (LVM resource)
+* LVM (LVM resource)
-*FS (FS resource)
+* FS (FS resource)
-*NFS (FS resource)
+* NFS (FS resource)
-*User
+* User
-*IP (IP resource)
+* IP (IP resource)
-*DNS name
+* DNS name
-*Process (Process resource)
+* Process (Process resource)
-*Listener (Listener resource)
+* Listener (Listener resource)
 == HPC ==
 [[File:HPC.drawio.png]]
-== IT wage ==
-*[http://jobsearchtech.about.com/od/educationfortechcareers/tp/HighestCerts.htm Best IT certifications]
+== IT Wage ==
-*[https://www.silkhom.com/barometre-2021-des-tjm-dans-informatique-digital FREELANCE]
+* [http://jobsearchtech.about.com/od/educationfortechcareers/tp/HighestCerts.htm Best IT certifications]
-*[http://www.journaldunet.com/solutions/emploi-rh/salaire-dans-l-informatique-hays IT]
+* [https://www.silkhom.com/barometre-2021-des-tjm-dans-informatique-digital FREELANCE]
+* [http://www.journaldunet.com/solutions/emploi-rh/salaire-dans-l-informatique-hays IT]
 == SRE ==
 * [https://openapm.io SRE]
-== REDHAT package browser ==
-* [https://access.redhat.com/downloads/content/package-browser REDHAT package browser]
+== REDHAT Package Browser ==
+* [https://access.redhat.com/downloads/content/package-browser REDHAT Package Browser]

Main Page: Difference between revisions

Latest revision as of 14:48, 23 November 2024

Contents

AI Tools

DEV

INTERESTING LLMs

NEWS

TRAINING

CLOUD LAB

CLOUD Audit

CLOUD Migration Example

WEB Enhancement

CLOUD WIKIPEDIA

CLOUD vs HW

CLOUD providers

CLOUD INTERNET NETWORK

CLOUD NATIVE

High Availability (HA) with Corosync+Pacemaker

Typical Architecture

Typical Service Pattern

HPC

IT Wage

SRE

REDHAT Package Browser

Navigation menu

Main Page: Difference between revisions

Latest revision as of 14:48, 23 November 2024

AI Tools

DEV

INTERESTING LLMs

NEWS

TRAINING

CLOUD LAB

CLOUD Audit

CLOUD Migration Example

WEB Enhancement

CLOUD WIKIPEDIA

CLOUD vs HW

CLOUD providers

CLOUD INTERNET NETWORK

CLOUD NATIVE

High Availability (HA) with Corosync+Pacemaker

Typical Architecture

Typical Service Pattern

HPC

IT Wage

SRE

REDHAT Package Browser

Navigation menu

Search