Main Page: Difference between revisions
Jump to navigation
Jump to search
(21 intermediate revisions by the same user not shown) | |||
Line 10: | Line 10: | ||
<br> | <br> | ||
== AI Tools == | == AI Tools == | ||
*[https://chat.openai.com ChatGPT4] - | * [https://chat.openai.com ChatGPT4] - Public assistant with learning abilities. | ||
*[https://github.com/open-webui/open-webui open-webui] + [https://www.scaleway.com/en/h100-pcie-try-it-now/ GPU H100] + [https://ollama.com Ollama] - | * [https://github.com/open-webui/open-webui open-webui] + [https://www.scaleway.com/en/h100-pcie-try-it-now/ GPU H100] + [https://ollama.com Ollama] - Private assistant and API. | ||
*[https://github.com/ynotopec/summarize | * [https://github.com/ynotopec/summarize Private summary] | ||
=== DEV === | === DEV === | ||
( | (28/08/2024) | ||
*[https:// | * [https://ollama.com/library LLM Trending] | ||
*[https:// | * [https://github.com/search?q=stars%3A%3E15000+forks%3A%3E1500+created%3A%3E2022-06-01&type=repositories&s=updated&o=desc Project Trending] | ||
*[https:// | * [https://huggingface.co/spaces/HuggingFaceH4/open_llm_leaderboard LLM Ranking] | ||
*[https://chat.lmsys.org ChatBot Evaluate] | * [https://chat.lmsys.org ChatBot Evaluate] | ||
*[https:// | * [https://www.perplexity.ai Perplexity AI] - R&D | ||
*[https://huggingface.co/ | * [https://huggingface.co/models Models Trending] | ||
*[https:// | * [https://github.com/hiyouga/LLaMA-Factory LLM Fine Tuning] | ||
*[https:// | * [https://huggingface.co/spaces/mteb/leaderboard Embeddings Ranking] | ||
* [https://ann-benchmarks.com Vectors DB Ranking] | |||
*[https://ann-benchmarks.com Vectors DB Ranking] | * [https://www.nvidia.com/en-us/data-center/h100/ NVIDIA H100] - KUBERNETES or HPC clusters for DATASCIENCE. | ||
* [https://www.nvidia.com/fr-fr/geforce/graphics-cards/40-series/rtx-4080-family NVIDIA 4080] - GPU card for private assistance. | |||
*[https://www.nvidia.com/en-us/data-center/ | * [https://huggingface.co/models?pipeline_tag=image-text-to-text&sort=trending Img2txt Trending] | ||
*[https://www.nvidia.com/fr-fr/geforce/graphics-cards/40-series/rtx-4080-family NVIDIA 4080] - GPU card for private assistance. | * [https://huggingface.co/spaces/TIGER-Lab/GenAI-Arena Txt2img Evaluate] | ||
* [https://github.com/chatchat-space/Langchain-Chatchat Chatchat] - Private assistant with RAG capabilities in Chinese. | |||
* [https://top500.org/lists/green500/ HPC Efficiency] | |||
==== INTERESTING LLMs ==== | ==== INTERESTING LLMs ==== | ||
( | (23/11/2024) | ||
{| class="wikitable" | |||
! Model | |||
! Comment | |||
|- | |||
| '''parse''' | |||
| gemma2-simpo | |||
|- | |||
| '''RAG''' | |||
| gemma2-simpo | |||
|- | |||
| '''RAG-FR''' | |||
| qwen2.5 | |||
|- | |||
| '''code''' | |||
| gemma2-27b, $$ | |||
|- | |||
| '''code-completion''' | |||
| deepseek-coder:base | |||
|- | |||
| '''summary''' | |||
| qwen2.5 | |||
|- | |||
| '''ai-translate''' | |||
| gemma2, temperature 0 | |||
|- | |||
| '''chat-leger''' | |||
| 0.000055 euros/token, gemma2-simpo | |||
|- | |||
| '''chat-lourd''' | |||
| 0.00015 euros/token, gemma2-27b, $$ | |||
|- | |||
| '''mannix/gemma2-9b-simpo''' | |||
| OllamaFunctions | |||
|} | |||
=== NEWS === | === NEWS === | ||
( | (04/05/2024) | ||
* [https://www.youtube.com/@lev-selector/videos Very good AI News] | * [https://www.youtube.com/@lev-selector/videos Very good AI News] | ||
* For the [https://betterprogramming.pub/color-your-captions-streamlining-live-transcriptions-with-diart-and-openais-whisper-6203350234ef '''transcription'''] in real time with Diart, it is possible to follow the interlocutors. | |||
* [https://github.com/openai-translator/openai-translator Translation] tools like Google Translate are becoming popular. | |||
* [https://www.mouser.fr/ProductDetail/BittWare/RS-GQ-GC1-0109?qs=ST9lo4GX8V2eGrFMeVQmFw%3D%3D '''LLM 10x accelerator'''] and cheaper with GROQ. | |||
* For the [https://betterprogramming.pub/color-your-captions-streamlining-live-transcriptions-with-diart-and-openais-whisper-6203350234ef '''transcription'''] real time with Diart it is possible to follow the interlocutors | |||
* [https://github.com/openai-translator/openai-translator | |||
* [https://www.mouser.fr/ProductDetail/BittWare/RS-GQ-GC1-0109?qs=ST9lo4GX8V2eGrFMeVQmFw%3D%3D '''LLM 10x accelerator'''] and cheaper with GROQ | |||
* [https://opensearch.org/docs/latest/search-plugins/conversational-search Opensearch with LLM] | * [https://opensearch.org/docs/latest/search-plugins/conversational-search Opensearch with LLM] | ||
=== TRAINING === | === TRAINING === | ||
*[https://www.youtube.com/watch?v=4Bdc55j80l8 TRANSFORMERS ALGORITHM | * [https://www.youtube.com/watch?v=4Bdc55j80l8 TRANSFORMERS ALGORITHM] | ||
== CLOUD LAB == | == CLOUD LAB == | ||
[[ | [[File:Infocepo.drawio.png]] | ||
<br><br> | <br><br> | ||
Presenting my [[LAB project]]. | Presenting my [[LAB project]]. | ||
Line 67: | Line 90: | ||
== CLOUD Migration Example == | == CLOUD Migration Example == | ||
[[File:Diagram-migration-ORACLE-KVM-v2.drawio.png]] | [[File:Diagram-migration-ORACLE-KVM-v2.drawio.png]] | ||
*1.5d: Infrastructure audit of 82 services ([https://infocepo.com/wiki/index.php/ServerDiff.sh ServerDiff.sh]) | * 1.5d: Infrastructure audit of 82 services ([https://infocepo.com/wiki/index.php/ServerDiff.sh ServerDiff.sh]) | ||
* 1.5d: Create cloud architecture diagram. | |||
* 1.5d: Compliance check of 2 clouds (6 hypervisors, 6TB memory). | |||
* 1d: Cloud installations. | |||
* 0.5d: Stability check. | |||
{| style="border-spacing:0;width:18.12cm;" | {| style="border-spacing:0;width:18.12cm;" | ||
|- style="background-color:#ffc000;border:0.05pt solid #000000;padding:0.049cm;" | |- style="background-color:#ffc000;border:0.05pt solid #000000;padding:0.049cm;" | ||
Line 90: | Line 110: | ||
| style="background-color:#d8e4bc;border:0.05pt solid #000000;padding:0.049cm;color:#000000;" | | | style="background-color:#d8e4bc;border:0.05pt solid #000000;padding:0.049cm;color:#000000;" | | ||
|- | |- | ||
| style="border:0.05pt solid #000000;padding:0.049cm;color:#000000;" | Power off | | style="border:0.05pt solid #000000;padding:0.049cm;color:#000000;" | Power off all nodes simultaneously. Power on all nodes simultaneously. | ||
| style="border:0.05pt solid #000000;padding:0.049cm;color:#000000;" | All resources are started. | | style="border:0.05pt solid #000000;padding:0.049cm;color:#000000;" | All resources are started. | ||
| style="background-color:#d8e4bc;border:0.05pt solid #000000;padding:0.049cm;color:#000000;" | | | style="background-color:#d8e4bc;border:0.05pt solid #000000;padding:0.049cm;color:#000000;" | | ||
|- | |- | ||
|} | |} | ||
*1.5d: Cloud automation study | * 1.5d: Cloud automation study. | ||
* 1.5d: Develop 6 templates (2 clouds, 2 OS, 8 environments, 2 versions). | |||
*1.5d: Develop 6 templates (2 clouds, 2 OS, 8 environments, 2 versions) | * 1d: Create migration diagram. | ||
* 1.5d: Write 138 lines of migration code ([https://infocepo.com/wiki/index.php/MigrationApp.sh MigrationApp.sh]). | |||
*1d: Create migration diagram | * 1.5d: Process stabilization. | ||
* 1.5d: Cloud vs. old infrastructure benchmark. | |||
*1.5d: Write 138 lines of migration code ([https://infocepo.com/wiki/index.php/MigrationApp.sh MigrationApp.sh]) | * 0.5d: Unavailability time calibration per migration unit. | ||
* 5 min: Load 82 VMs (env, OS, application code, 2 IPs). | |||
*1.5d: Process stabilization | |||
*1.5d: Cloud vs old infrastructure benchmark | |||
*.5d: Unavailability time calibration per migration unit | |||
* | |||
Total = 15 man-days. | |||
== WEB Enhancement == | == WEB Enhancement == | ||
Line 123: | Line 136: | ||
* Opt for fast HTTP cache like VARNISH and Apache Traffic Server for large files. | * Opt for fast HTTP cache like VARNISH and Apache Traffic Server for large files. | ||
* Use PROXY with TLS decoder like ENVOY for service compatibility. | * Use PROXY with TLS decoder like ENVOY for service compatibility. | ||
* Consider serverless | * Consider serverless services for standard runtimes, mindful of potential incompatibilities. | ||
* Employ load balancing or native services for dynamic computing power. | * Employ load balancing or native services for dynamic computing power. | ||
* Use open source STACKs where possible. | * Use open-source STACKs where possible. | ||
* Employ database caches like MEMCACHED. | * Employ database caches like MEMCACHED. | ||
* Use queues for long | * Use queues for long batches. | ||
* Use buffers for stability of real streams. | * Use buffers for stability of real streams. | ||
* More information at [https://wikitech.wikimedia.org/wiki/Wikimedia_infrastructure CLOUD WIKIPEDIA] and [https://github.com/systemdesign42/system-design GITHUB]. | * More information at [https://wikitech.wikimedia.org/wiki/Wikimedia_infrastructure CLOUD WIKIPEDIA] and [https://github.com/systemdesign42/system-design GITHUB]. | ||
Line 221: | Line 234: | ||
== CLOUD providers == | == CLOUD providers == | ||
* [https://cloud.google.com/free/docs/aws-azure-gcp-service-comparison CLOUD providers] | * [https://cloud.google.com/free/docs/aws-azure-gcp-service-comparison CLOUD providers] | ||
== CLOUD INTERNET NETWORK == | == CLOUD INTERNET NETWORK == | ||
* [https://global-internet-map-2021.telegeography.com/ CLOUD INTERNET NETWORK] | * [https://global-internet-map-2021.telegeography.com/ CLOUD INTERNET NETWORK] | ||
== CLOUD NATIVE == | == CLOUD NATIVE == | ||
* [https://landscape.cncf.io/?fullscreen=yes OFFICIAL STACKS] | * [https://landscape.cncf.io/?fullscreen=yes OFFICIAL STACKS] | ||
Line 233: | Line 248: | ||
=== Typical Architecture === | === Typical Architecture === | ||
*Dual-room. | * Dual-room. | ||
*IPMI LAN (fencing). | * IPMI LAN (fencing). | ||
*NTP, DNS+DHCP+PXE+TFTP+HTTP (auto-provisioning), PROXY (updates or internal REPOSITORY). | * NTP, DNS+DHCP+PXE+TFTP+HTTP (auto-provisioning), PROXY (updates or internal REPOSITORY). | ||
*Choose 2+ node clusters. | * Choose 2+ node clusters. | ||
*For 2-node, require COROSYNC 2-node config, 10-second staggered closing for stability. | * For 2-node, require COROSYNC 2-node config, 10-second staggered closing for stability. For better stability, choose 3+ nodes architecture. | ||
*Allocate 4GB/base for DB resources. CPU resource requirements generally low. | * Allocate 4GB/base for DB resources. CPU resource requirements are generally low. | ||
=== Typical Service Pattern === | === Typical Service Pattern === | ||
*Multipath | * Multipath | ||
*LUN | * LUN | ||
*LVM (LVM resource) | * LVM (LVM resource) | ||
*FS (FS resource) | * FS (FS resource) | ||
*NFS (FS resource) | * NFS (FS resource) | ||
*User | * User | ||
*IP (IP resource) | * IP (IP resource) | ||
*DNS name | * DNS name | ||
*Process (Process resource) | * Process (Process resource) | ||
*Listener (Listener resource) | * Listener (Listener resource) | ||
== HPC == | == HPC == | ||
[[File:HPC.drawio.png]] | [[File:HPC.drawio.png]] | ||
== IT | |||
*[http://jobsearchtech.about.com/od/educationfortechcareers/tp/HighestCerts.htm Best IT certifications] | == IT Wage == | ||
*[https://www.silkhom.com/barometre-2021-des-tjm-dans-informatique-digital FREELANCE] | * [http://jobsearchtech.about.com/od/educationfortechcareers/tp/HighestCerts.htm Best IT certifications] | ||
*[http://www.journaldunet.com/solutions/emploi-rh/salaire-dans-l-informatique-hays IT] | * [https://www.silkhom.com/barometre-2021-des-tjm-dans-informatique-digital FREELANCE] | ||
* [http://www.journaldunet.com/solutions/emploi-rh/salaire-dans-l-informatique-hays IT] | |||
== SRE == | == SRE == | ||
* [https://openapm.io SRE] | * [https://openapm.io SRE] | ||
== REDHAT | |||
* [https://access.redhat.com/downloads/content/package-browser REDHAT | == REDHAT Package Browser == | ||
* [https://access.redhat.com/downloads/content/package-browser REDHAT Package Browser] |
Latest revision as of 14:48, 23 November 2024
Discover cloud computing on infocepo.com:
- Master cloud infrastructure
- Explore AI
- Compare Kubernetes and AWS
- Advance your IT skills with hands-on labs and open-source software.
Start your journey to expertise.
AI Tools
- ChatGPT4 - Public assistant with learning abilities.
- open-webui + GPU H100 + Ollama - Private assistant and API.
- Private summary
DEV
(28/08/2024)
- LLM Trending
- Project Trending
- LLM Ranking
- ChatBot Evaluate
- Perplexity AI - R&D
- Models Trending
- LLM Fine Tuning
- Embeddings Ranking
- Vectors DB Ranking
- NVIDIA H100 - KUBERNETES or HPC clusters for DATASCIENCE.
- NVIDIA 4080 - GPU card for private assistance.
- Img2txt Trending
- Txt2img Evaluate
- Chatchat - Private assistant with RAG capabilities in Chinese.
- HPC Efficiency
INTERESTING LLMs
(23/11/2024)
Model | Comment |
---|---|
parse | gemma2-simpo |
RAG | gemma2-simpo |
RAG-FR | qwen2.5 |
code | gemma2-27b, $$ |
code-completion | deepseek-coder:base |
summary | qwen2.5 |
ai-translate | gemma2, temperature 0 |
chat-leger | 0.000055 euros/token, gemma2-simpo |
chat-lourd | 0.00015 euros/token, gemma2-27b, $$ |
mannix/gemma2-9b-simpo | OllamaFunctions |
NEWS
(04/05/2024)
- Very good AI News
- For the transcription in real time with Diart, it is possible to follow the interlocutors.
- Translation tools like Google Translate are becoming popular.
- LLM 10x accelerator and cheaper with GROQ.
- Opensearch with LLM
TRAINING
CLOUD LAB
Presenting my LAB project.
CLOUD Audit
Created ServerDiff.sh for server audits. Enables configuration drift tracking and environment consistency checks.
CLOUD Migration Example
- 1.5d: Infrastructure audit of 82 services (ServerDiff.sh)
- 1.5d: Create cloud architecture diagram.
- 1.5d: Compliance check of 2 clouds (6 hypervisors, 6TB memory).
- 1d: Cloud installations.
- 0.5d: Stability check.
ACTION | RESULT | OK/KO |
Activate maintenance for n/2-1 nodes or 1 node if 2 nodes. | All resources are started. | |
Un-maintenance all nodes. Power off n/2-1 nodes or 1 node if 2 nodes, different from the previous test. | All resources are started. | |
Power off all nodes simultaneously. Power on all nodes simultaneously. | All resources are started. |
- 1.5d: Cloud automation study.
- 1.5d: Develop 6 templates (2 clouds, 2 OS, 8 environments, 2 versions).
- 1d: Create migration diagram.
- 1.5d: Write 138 lines of migration code (MigrationApp.sh).
- 1.5d: Process stabilization.
- 1.5d: Cloud vs. old infrastructure benchmark.
- 0.5d: Unavailability time calibration per migration unit.
- 5 min: Load 82 VMs (env, OS, application code, 2 IPs).
Total = 15 man-days.
WEB Enhancement
- Formalize infrastructure for flexibility and reduced complexity.
- Utilize customer-location tracking name server like GDNS.
- Use minimal instances with a network load balancer like LVS.
- Compare prices of dynamic computing services, beware of tech lock-in.
- Employ efficient frontend TLS decoder like HAPROXY.
- Opt for fast HTTP cache like VARNISH and Apache Traffic Server for large files.
- Use PROXY with TLS decoder like ENVOY for service compatibility.
- Consider serverless services for standard runtimes, mindful of potential incompatibilities.
- Employ load balancing or native services for dynamic computing power.
- Use open-source STACKs where possible.
- Employ database caches like MEMCACHED.
- Use queues for long batches.
- Use buffers for stability of real streams.
- More information at CLOUD WIKIPEDIA and GITHUB.
CLOUD WIKIPEDIA
CLOUD vs HW
Function | Kubernetes | OpenStack | AWS | Bare-metal | HPC | CRM | oVirt |
---|---|---|---|---|---|---|---|
Deployment Tools (Tools used for deployment) |
Helm, YAML, Operator, Ansible, Juju, ArgoCD | Ansible, Packer, Terraform, Juju | Ansible, Terraform, CloudFormation, Juju | Ansible, Shell Scripts | xCAT, Clush | Ansible, Shell Scripts | Ansible, Python, Shell Scripts |
Bootstrap Method (Initial configuration and setup) |
API | API, PXE | API | PXE, IPMI | PXE, IPMI | PXE, IPMI | PXE, API |
Router Control (Routing services) |
API (Kube-router) | API (Router/Subnet) | API (Route Table/Subnet) | Linux, OVS, External Hardware | xCAT, External Hardware | Linux, External Hardware | API |
Firewall Control (Firewall rules and policies) |
Ingress, Egress, Istio, NetworkPolicy | API (Security Groups) | API (Security Group) | Linux Firewall | Linux Firewall | Linux Firewall | API |
Network Virtualization (VLAN/VxLAN technologies) |
Multiple Options | VPC | VPC | OVS, Linux, External Hardware | xCAT, External Hardware | Linux, External Hardware | API |
Name Server Control (DNS services) |
CoreDNS | DNS-Nameserver | Amazon Route 53 | GDNS | xCAT | Linux, External Hardware | API, External Hardware |
Load Balancer (Load balancing options) |
Kube-proxy, LVS (IPVS) | LVS | Network Load Balancer | LVS | SLURM | Ldirectord | N/A |
Storage Options (Available storage technologies) |
Multiple Options | Swift, Cinder, Nova | S3, EFS, FSx, EBS | Swift, XFS, EXT4, RAID10 | GPFS | SAN | NFS, SAN |
CLOUD providers
CLOUD INTERNET NETWORK
CLOUD NATIVE
- OFFICIAL STACKS
- DevSecOps :
High Availability (HA) with Corosync+Pacemaker
Typical Architecture
- Dual-room.
- IPMI LAN (fencing).
- NTP, DNS+DHCP+PXE+TFTP+HTTP (auto-provisioning), PROXY (updates or internal REPOSITORY).
- Choose 2+ node clusters.
- For 2-node, require COROSYNC 2-node config, 10-second staggered closing for stability. For better stability, choose 3+ nodes architecture.
- Allocate 4GB/base for DB resources. CPU resource requirements are generally low.
Typical Service Pattern
- Multipath
- LUN
- LVM (LVM resource)
- FS (FS resource)
- NFS (FS resource)
- User
- IP (IP resource)
- DNS name
- Process (Process resource)
- Listener (Listener resource)