Main Page: Difference between revisions
Jump to navigation
Jump to search
No edit summary |
|||
(409 intermediate revisions by 4 users not shown) | |||
Line 1: | Line 1: | ||
[[File:Infocepo-illustration.jpg|thumb|right]] | |||
'''Discover cloud computing on infocepo.com''': | |||
* Master cloud infrastructure | |||
* Explore AI | |||
* Compare Kubernetes and AWS | |||
* Advance your IT skills with hands-on labs and open-source software. | |||
Start your journey to expertise. | |||
* | <br> | ||
== AI Tools == | |||
* [https://chat.openai.com ChatGPT4] - Public assistant with learning abilities. | |||
* [https://github.com/open-webui/open-webui open-webui] + [https://www.scaleway.com/en/h100-pcie-try-it-now/ GPU H100] + [https://ollama.com Ollama] - Private assistant and API. | |||
* [https://github.com/ynotopec/summarize Private summary] | |||
* | === DEV === | ||
(28/08/2024) | |||
* [https://ollama.com/library LLM Trending] | |||
* [https://github.com/search?q=stars%3A%3E15000+forks%3A%3E1500+created%3A%3E2022-06-01&type=repositories&s=updated&o=desc Project Trending] | |||
* [https://huggingface.co/spaces/HuggingFaceH4/open_llm_leaderboard LLM Ranking] | |||
* [https://chat.lmsys.org ChatBot Evaluate] | |||
* [https://www.perplexity.ai Perplexity AI] - R&D | |||
* [https://huggingface.co/models Models Trending] | |||
* [https://github.com/hiyouga/LLaMA-Factory LLM Fine Tuning] | |||
* [https://huggingface.co/spaces/mteb/leaderboard Embeddings Ranking] | |||
* [https://ann-benchmarks.com Vectors DB Ranking] | |||
* [https://www.nvidia.com/en-us/data-center/h100/ NVIDIA H100] - KUBERNETES or HPC clusters for DATASCIENCE. | |||
* [https://www.nvidia.com/fr-fr/geforce/graphics-cards/40-series/rtx-4080-family NVIDIA 4080] - GPU card for private assistance. | |||
* [https://huggingface.co/models?pipeline_tag=image-text-to-text&sort=trending Img2txt Trending] | |||
* [https://huggingface.co/spaces/TIGER-Lab/GenAI-Arena Txt2img Evaluate] | |||
* [https://github.com/chatchat-space/Langchain-Chatchat Chatchat] - Private assistant with RAG capabilities in Chinese. | |||
* [https://top500.org/lists/green500/ HPC Efficiency] | |||
==== INTERESTING LLMs ==== | |||
(23/11/2024) | |||
{| class="wikitable" | |||
! Model | |||
! Comment | |||
|- | |||
| '''parse''' | |||
| gemma2-simpo | |||
|- | |||
| '''RAG''' | |||
| gemma2-simpo | |||
|- | |||
| '''RAG-FR''' | |||
| qwen2.5 | |||
|- | |||
| '''code''' | |||
| gemma2-27b, $$ | |||
|- | |||
| '''code-completion''' | |||
| deepseek-coder:base | |||
|- | |||
| '''summary''' | |||
| qwen2.5 | |||
|- | |||
| '''ai-translate''' | |||
| gemma2, temperature 0 | |||
|- | |||
| '''chat-leger''' | |||
| 0.000055 euros/token, gemma2-simpo | |||
|- | |||
| '''chat-lourd''' | |||
| 0.00015 euros/token, gemma2-27b, $$ | |||
|- | |||
| '''mannix/gemma2-9b-simpo''' | |||
| OllamaFunctions | |||
|} | |||
=== NEWS === | |||
(04/05/2024) | |||
* [https://www.youtube.com/@lev-selector/videos Very good AI News] | |||
* For the [https://betterprogramming.pub/color-your-captions-streamlining-live-transcriptions-with-diart-and-openais-whisper-6203350234ef '''transcription'''] in real time with Diart, it is possible to follow the interlocutors. | |||
* [https://github.com/openai-translator/openai-translator Translation] tools like Google Translate are becoming popular. | |||
* [https://www.mouser.fr/ProductDetail/BittWare/RS-GQ-GC1-0109?qs=ST9lo4GX8V2eGrFMeVQmFw%3D%3D '''LLM 10x accelerator'''] and cheaper with GROQ. | |||
* [https://opensearch.org/docs/latest/search-plugins/conversational-search Opensearch with LLM] | |||
* | === TRAINING === | ||
* [https://www.youtube.com/watch?v=4Bdc55j80l8 TRANSFORMERS ALGORITHM] | |||
== CLOUD LAB == | |||
[[File:Infocepo.drawio.png]] | |||
<br><br> | |||
Presenting my [[LAB project]]. | |||
== CLOUD Audit == | |||
Created [[ServerDiff.sh]] for server audits. Enables configuration drift tracking and environment consistency checks. | |||
* | == CLOUD Migration Example == | ||
[[File:Diagram-migration-ORACLE-KVM-v2.drawio.png]] | |||
* 1.5d: Infrastructure audit of 82 services ([https://infocepo.com/wiki/index.php/ServerDiff.sh ServerDiff.sh]) | |||
* 1.5d: Create cloud architecture diagram. | |||
* 1.5d: Compliance check of 2 clouds (6 hypervisors, 6TB memory). | |||
* 1d: Cloud installations. | |||
* 0.5d: Stability check. | |||
* | {| style="border-spacing:0;width:18.12cm;" | ||
|- style="background-color:#ffc000;border:0.05pt solid #000000;padding:0.049cm;" | |||
| align=center style="color:#000000;" | '''ACTION''' | |||
| align=center style="color:#000000;" | '''RESULT''' | |||
| align=center style="color:#000000;" | '''OK/KO''' | |||
|- | |||
| style="border:0.05pt solid #000000;padding:0.049cm;color:#000000;" | Activate maintenance for n/2-1 nodes or 1 node if 2 nodes. | |||
| style="border:0.05pt solid #000000;padding:0.049cm;color:#000000;" | All resources are started. | |||
| style="background-color:#d8e4bc;border:0.05pt solid #000000;padding:0.049cm;color:#000000;" | | |||
|- | |||
| style="border:0.05pt solid #000000;padding:0.049cm;color:#000000;" | Un-maintenance all nodes. Power off n/2-1 nodes or 1 node if 2 nodes, different from the previous test. | |||
| style="border:0.05pt solid #000000;padding:0.049cm;color:#000000;" | All resources are started. | |||
| style="background-color:#d8e4bc;border:0.05pt solid #000000;padding:0.049cm;color:#000000;" | | |||
|- | |||
| style="border:0.05pt solid #000000;padding:0.049cm;color:#000000;" | Power off all nodes simultaneously. Power on all nodes simultaneously. | |||
| style="border:0.05pt solid #000000;padding:0.049cm;color:#000000;" | All resources are started. | |||
| style="background-color:#d8e4bc;border:0.05pt solid #000000;padding:0.049cm;color:#000000;" | | |||
|- | |||
|} | |||
* 1.5d: Cloud automation study. | |||
* 1.5d: Develop 6 templates (2 clouds, 2 OS, 8 environments, 2 versions). | |||
* 1d: Create migration diagram. | |||
* 1.5d: Write 138 lines of migration code ([https://infocepo.com/wiki/index.php/MigrationApp.sh MigrationApp.sh]). | |||
* 1.5d: Process stabilization. | |||
* 1.5d: Cloud vs. old infrastructure benchmark. | |||
* 0.5d: Unavailability time calibration per migration unit. | |||
* 5 min: Load 82 VMs (env, OS, application code, 2 IPs). | |||
Total = 15 man-days. | |||
== WEB Enhancement == | |||
[[File:WebModelDiagram.drawio.png]] | |||
* Formalize infrastructure for flexibility and reduced complexity. | |||
* Utilize customer-location tracking name server like GDNS. | |||
* Use minimal instances with a network load balancer like LVS. | |||
* Compare prices of dynamic computing services, beware of tech lock-in. | |||
* Employ efficient frontend TLS decoder like HAPROXY. | |||
* Opt for fast HTTP cache like VARNISH and Apache Traffic Server for large files. | |||
* Use PROXY with TLS decoder like ENVOY for service compatibility. | |||
* Consider serverless services for standard runtimes, mindful of potential incompatibilities. | |||
* Employ load balancing or native services for dynamic computing power. | |||
* Use open-source STACKs where possible. | |||
* Employ database caches like MEMCACHED. | |||
* Use queues for long batches. | |||
* Use buffers for stability of real streams. | |||
* More information at [https://wikitech.wikimedia.org/wiki/Wikimedia_infrastructure CLOUD WIKIPEDIA] and [https://github.com/systemdesign42/system-design GITHUB]. | |||
* | == CLOUD WIKIPEDIA == | ||
* [https://wikitech.wikimedia.org/wiki/Wikimedia_infrastructure CLOUD WIKIPEDIA] | |||
== CLOUD vs HW == | |||
{| class="wikitable" | |||
|- | |||
! Function | |||
! Kubernetes | |||
! OpenStack | |||
! AWS | |||
! Bare-metal | |||
! HPC | |||
! CRM | |||
! oVirt | |||
|- | |||
| '''Deployment Tools'''<br>''(Tools used for deployment)'' | |||
| Helm, YAML, Operator, Ansible, Juju, ArgoCD | |||
| Ansible, Packer, Terraform, Juju | |||
| Ansible, Terraform, CloudFormation, Juju | |||
| Ansible, Shell Scripts | |||
| xCAT, Clush | |||
| Ansible, Shell Scripts | |||
| Ansible, Python, Shell Scripts | |||
|- | |||
| '''Bootstrap Method'''<br>''(Initial configuration and setup)'' | |||
| API | |||
| API, PXE | |||
| API | |||
| PXE, IPMI | |||
| PXE, IPMI | |||
| PXE, IPMI | |||
| PXE, API | |||
|- | |||
| '''Router Control'''<br>''(Routing services)'' | |||
| API (Kube-router) | |||
| API (Router/Subnet) | |||
| API (Route Table/Subnet) | |||
| Linux, OVS, External Hardware | |||
| xCAT, External Hardware | |||
| Linux, External Hardware | |||
| API | |||
|- | |||
| '''Firewall Control'''<br>''(Firewall rules and policies)'' | |||
| Ingress, Egress, Istio, NetworkPolicy | |||
| API (Security Groups) | |||
| API (Security Group) | |||
| Linux Firewall | |||
| Linux Firewall | |||
| Linux Firewall | |||
| API | |||
|- | |||
| '''Network Virtualization'''<br>''(VLAN/VxLAN technologies)'' | |||
| Multiple Options | |||
| VPC | |||
| VPC | |||
| OVS, Linux, External Hardware | |||
| xCAT, External Hardware | |||
| Linux, External Hardware | |||
| API | |||
|- | |||
| '''Name Server Control'''<br>''(DNS services)'' | |||
| CoreDNS | |||
| DNS-Nameserver | |||
| Amazon Route 53 | |||
| GDNS | |||
| xCAT | |||
| Linux, External Hardware | |||
| API, External Hardware | |||
|- | |||
| '''Load Balancer'''<br>''(Load balancing options)'' | |||
| Kube-proxy, LVS (IPVS) | |||
| LVS | |||
| Network Load Balancer | |||
| LVS | |||
| SLURM | |||
| Ldirectord | |||
| N/A | |||
|- | |||
| '''Storage Options'''<br>''(Available storage technologies)'' | |||
| Multiple Options | |||
| Swift, Cinder, Nova | |||
| S3, EFS, FSx, EBS | |||
| Swift, XFS, EXT4, RAID10 | |||
| GPFS | |||
| SAN | |||
| NFS, SAN | |||
|} | |||
* | == CLOUD providers == | ||
* [https://cloud.google.com/free/docs/aws-azure-gcp-service-comparison CLOUD providers] | |||
* | == CLOUD INTERNET NETWORK == | ||
* [https://global-internet-map-2021.telegeography.com/ CLOUD INTERNET NETWORK] | |||
* | == CLOUD NATIVE == | ||
* [https://landscape.cncf.io/?fullscreen=yes OFFICIAL STACKS] | |||
* DevSecOps : | |||
[[File:DSO-POC-V3.drawio.png]] | |||
== High Availability (HA) with Corosync+Pacemaker == | |||
[[File:HA-REF.drawio.png]] | |||
== | === Typical Architecture === | ||
* | * Dual-room. | ||
* | * IPMI LAN (fencing). | ||
* NTP, DNS+DHCP+PXE+TFTP+HTTP (auto-provisioning), PROXY (updates or internal REPOSITORY). | |||
* Choose 2+ node clusters. | |||
* For 2-node, require COROSYNC 2-node config, 10-second staggered closing for stability. For better stability, choose 3+ nodes architecture. | |||
* Allocate 4GB/base for DB resources. CPU resource requirements are generally low. | |||
== | === Typical Service Pattern === | ||
* Multipath | |||
* LUN | |||
* LVM (LVM resource) | |||
* FS (FS resource) | |||
* NFS (FS resource) | |||
* User | |||
* IP (IP resource) | |||
* DNS name | |||
* Process (Process resource) | |||
* Listener (Listener resource) | |||
== HPC == | |||
[[File:HPC.drawio.png]] | |||
== | == IT Wage == | ||
* [http://jobsearchtech.about.com/od/educationfortechcareers/tp/HighestCerts.htm Best IT certifications] | |||
* [https://www.silkhom.com/barometre-2021-des-tjm-dans-informatique-digital FREELANCE] | |||
* [http://www.journaldunet.com/solutions/emploi-rh/salaire-dans-l-informatique-hays IT] | |||
* [ | == SRE == | ||
* [https://openapm.io SRE] | |||
== REDHAT Package Browser == | |||
* [https://access.redhat.com/downloads/content/package-browser REDHAT Package Browser] |
Latest revision as of 14:48, 23 November 2024
Discover cloud computing on infocepo.com:
- Master cloud infrastructure
- Explore AI
- Compare Kubernetes and AWS
- Advance your IT skills with hands-on labs and open-source software.
Start your journey to expertise.
AI Tools
- ChatGPT4 - Public assistant with learning abilities.
- open-webui + GPU H100 + Ollama - Private assistant and API.
- Private summary
DEV
(28/08/2024)
- LLM Trending
- Project Trending
- LLM Ranking
- ChatBot Evaluate
- Perplexity AI - R&D
- Models Trending
- LLM Fine Tuning
- Embeddings Ranking
- Vectors DB Ranking
- NVIDIA H100 - KUBERNETES or HPC clusters for DATASCIENCE.
- NVIDIA 4080 - GPU card for private assistance.
- Img2txt Trending
- Txt2img Evaluate
- Chatchat - Private assistant with RAG capabilities in Chinese.
- HPC Efficiency
INTERESTING LLMs
(23/11/2024)
Model | Comment |
---|---|
parse | gemma2-simpo |
RAG | gemma2-simpo |
RAG-FR | qwen2.5 |
code | gemma2-27b, $$ |
code-completion | deepseek-coder:base |
summary | qwen2.5 |
ai-translate | gemma2, temperature 0 |
chat-leger | 0.000055 euros/token, gemma2-simpo |
chat-lourd | 0.00015 euros/token, gemma2-27b, $$ |
mannix/gemma2-9b-simpo | OllamaFunctions |
NEWS
(04/05/2024)
- Very good AI News
- For the transcription in real time with Diart, it is possible to follow the interlocutors.
- Translation tools like Google Translate are becoming popular.
- LLM 10x accelerator and cheaper with GROQ.
- Opensearch with LLM
TRAINING
CLOUD LAB
Presenting my LAB project.
CLOUD Audit
Created ServerDiff.sh for server audits. Enables configuration drift tracking and environment consistency checks.
CLOUD Migration Example
- 1.5d: Infrastructure audit of 82 services (ServerDiff.sh)
- 1.5d: Create cloud architecture diagram.
- 1.5d: Compliance check of 2 clouds (6 hypervisors, 6TB memory).
- 1d: Cloud installations.
- 0.5d: Stability check.
ACTION | RESULT | OK/KO |
Activate maintenance for n/2-1 nodes or 1 node if 2 nodes. | All resources are started. | |
Un-maintenance all nodes. Power off n/2-1 nodes or 1 node if 2 nodes, different from the previous test. | All resources are started. | |
Power off all nodes simultaneously. Power on all nodes simultaneously. | All resources are started. |
- 1.5d: Cloud automation study.
- 1.5d: Develop 6 templates (2 clouds, 2 OS, 8 environments, 2 versions).
- 1d: Create migration diagram.
- 1.5d: Write 138 lines of migration code (MigrationApp.sh).
- 1.5d: Process stabilization.
- 1.5d: Cloud vs. old infrastructure benchmark.
- 0.5d: Unavailability time calibration per migration unit.
- 5 min: Load 82 VMs (env, OS, application code, 2 IPs).
Total = 15 man-days.
WEB Enhancement
- Formalize infrastructure for flexibility and reduced complexity.
- Utilize customer-location tracking name server like GDNS.
- Use minimal instances with a network load balancer like LVS.
- Compare prices of dynamic computing services, beware of tech lock-in.
- Employ efficient frontend TLS decoder like HAPROXY.
- Opt for fast HTTP cache like VARNISH and Apache Traffic Server for large files.
- Use PROXY with TLS decoder like ENVOY for service compatibility.
- Consider serverless services for standard runtimes, mindful of potential incompatibilities.
- Employ load balancing or native services for dynamic computing power.
- Use open-source STACKs where possible.
- Employ database caches like MEMCACHED.
- Use queues for long batches.
- Use buffers for stability of real streams.
- More information at CLOUD WIKIPEDIA and GITHUB.
CLOUD WIKIPEDIA
CLOUD vs HW
Function | Kubernetes | OpenStack | AWS | Bare-metal | HPC | CRM | oVirt |
---|---|---|---|---|---|---|---|
Deployment Tools (Tools used for deployment) |
Helm, YAML, Operator, Ansible, Juju, ArgoCD | Ansible, Packer, Terraform, Juju | Ansible, Terraform, CloudFormation, Juju | Ansible, Shell Scripts | xCAT, Clush | Ansible, Shell Scripts | Ansible, Python, Shell Scripts |
Bootstrap Method (Initial configuration and setup) |
API | API, PXE | API | PXE, IPMI | PXE, IPMI | PXE, IPMI | PXE, API |
Router Control (Routing services) |
API (Kube-router) | API (Router/Subnet) | API (Route Table/Subnet) | Linux, OVS, External Hardware | xCAT, External Hardware | Linux, External Hardware | API |
Firewall Control (Firewall rules and policies) |
Ingress, Egress, Istio, NetworkPolicy | API (Security Groups) | API (Security Group) | Linux Firewall | Linux Firewall | Linux Firewall | API |
Network Virtualization (VLAN/VxLAN technologies) |
Multiple Options | VPC | VPC | OVS, Linux, External Hardware | xCAT, External Hardware | Linux, External Hardware | API |
Name Server Control (DNS services) |
CoreDNS | DNS-Nameserver | Amazon Route 53 | GDNS | xCAT | Linux, External Hardware | API, External Hardware |
Load Balancer (Load balancing options) |
Kube-proxy, LVS (IPVS) | LVS | Network Load Balancer | LVS | SLURM | Ldirectord | N/A |
Storage Options (Available storage technologies) |
Multiple Options | Swift, Cinder, Nova | S3, EFS, FSx, EBS | Swift, XFS, EXT4, RAID10 | GPFS | SAN | NFS, SAN |
CLOUD providers
CLOUD INTERNET NETWORK
CLOUD NATIVE
- OFFICIAL STACKS
- DevSecOps :
High Availability (HA) with Corosync+Pacemaker
Typical Architecture
- Dual-room.
- IPMI LAN (fencing).
- NTP, DNS+DHCP+PXE+TFTP+HTTP (auto-provisioning), PROXY (updates or internal REPOSITORY).
- Choose 2+ node clusters.
- For 2-node, require COROSYNC 2-node config, 10-second staggered closing for stability. For better stability, choose 3+ nodes architecture.
- Allocate 4GB/base for DB resources. CPU resource requirements are generally low.
Typical Service Pattern
- Multipath
- LUN
- LVM (LVM resource)
- FS (FS resource)
- NFS (FS resource)
- User
- IP (IP resource)
- DNS name
- Process (Process resource)
- Listener (Listener resource)