তসিংহু বিশ্ববিদ্যালয় এবং মিয়ানবি বিশ্বের প্রথম এআই-কোডিং প্রি-ট্রেনিং ফ্রেমওয়ার্ক ফর্জট্রেন ওপেন-সোর্স করেছে

ME AI সংবাদ, অনুসরণ করে Beating মনিটরিং, মিয়ানবিজ এবং তসিংহুয়া এনএলপি ল্যাব ওপেনবিএমবি সম্প্রদায়ে বিশ্বের প্রথম সম্পূর্ণরূপে এআই দ্বারা লেখা প্রোডাকশন-লেভেল লার্জ মডেল প্রিট্রেনিং ফ্রেমওয়ার্ক ForgeTrain ওপেন-সোর্স করেছে এবং ForgeTrain দ্বারা প্রশিক্ষিত এন্ড-সাইড ছোট মডেল MiniCPM5-1B প্রকাশ করেছে। 'এআই তৈরি করে এআই' এর ইঞ্জিনিয়ারিং সম্পূর্ণ চক্রের প্রথম উদাহরণ হিসেবে, ForgeTrain একই হার্ডওয়্যার শর্তে নভিডিয়ার Megatron-এর চেয়েও বেশি পারফরম্যান্স দেখিয়েছে এবং হুয়াওয়ে হাইসেঞ্চেন প্রিট্রেনিংয়ে 10% গতি বৃদ্ধি করেছে। একইসাথে, MiniCPM5-1B Artificial Analysis-এর ওপেন-ওয়েটস ছোট মডেল র‍্যাঙ্কিংয়ে প্রথম স্থান অধিকার করেছে। AI-কে নিজেই বেসমেন্ট প্রিট্রেনিং ইনফ্রাস্ট্রাকচার তৈরির জন্য, মিয়ানবিজ 'ফর্জ ইঞ্জিনিয়ারিং' (Forge Engineering) সফটওয়্যার প্রোগ্রামিং প্যারাডাইমটি প্রস্তাব করেছে, যা সব হার্ডওয়্যার এবং টাস্ককে সমর্থনকারী জেনারিক ফ্রেমওয়ার্ককে বাদ দিয়ে AI-এর কমখরচের কোডজেনারেশন ক্ষমতা ব্যবহার করে নির্দিষ্ট মডেল এবং হার্ডওয়্যারের জন্য স্পেসিফিককোডকে 'ফর্জ' (তৈরি) করে। ForgeTrain-এর গঠনগতভাবে, 3টি পর্যায়: প্রথমত, existing pre-training frameworks-এর key data collection-এর matrialization, Harness-এর formation; 2য়ত, auto-closed loop-এ binary-consistent framework code generation; 3য়ত, restriction removal and surpassing reference implementation। 3টি phase-এর automation evolution AI-এর L3 to L4 stage-এর corresponding। ForgeTrain-এর first output model MiniCPM5-1B-এ 1.08 billion parameters, core architecture standard LlamaForCausalLM-এর up on, significantly lowering the barrier for downstream integration and inference deployment। Artificial Analysis evaluation-এ, model 18 score with Qwen3.5-2B (16 score) surpasses 2B scale, and leads Qwen3.5-0.8B (11 score) and LFM2.5-1.2B-Thinking (8 score)। Model supports MLX 4-bit and GGUF Q4_K_M deployment formats; after INT4 quantization, weights only 0.5GB; natively supports 131,072 tokens long context and hybrid dual-mode inference based on enable_thinking। Extremely low hardware overhead, OpenBMB simultaneously open-sourced the purely offline desktop floating companion app MiniCPM Desk Pet, which supports real-time response to coding activities in development tools like Cursor and LoRA persona switching। (Source: BlockBeats)