請更新您的瀏覽器

您使用的瀏覽器版本較舊,已不再受支援。建議您更新瀏覽器版本,以獲得最佳使用體驗。

Eng

APTO Releases High-Accuracy Japanese Reasoning Data for LLM Fine-Tuning, Free of Charge

PR Newswire (美通社)

更新於 06月17日17:59 • 發布於 06月17日17:00 • PR Newswire

TOKYO, June 18, 2025 /PRNewswire/ -- APTO is pleased to announce the release of a free dataset for fine-tuning reasoning models, such as OpenAI's GPT-01 and Deepseek's Deepseek R1.

This dataset can help to improve reasoning ability in Japanese and reduce redundant inference.

This allows for faster inference even with limited token counts and memory usage.

Dataset Details

Each data entry includes a question that requires reasoning and its corresponding answer, with the thought process described within'think' XML tags.

This dataset consists of high-quality data generated by our proprietary technology and manually reviewed for accuracy.

Validation using models such as Qwen3 has confirmed that training with this dataset improves reasoning ability in Japanese and enables more efficient inference.

Additionally, testing with the Japanese MT-Bench showed performance improvements particularly in categories such as reasoning, math, and coding.

Figure 1: An example of JSON format in the free public dataset.

Figure 1: An example of JSON format in the free public dataset.

Tag Information

Each question-and-answer conversation is labeled with tag information indicating the subject matter and genre of the conversation.

The following labels are used:

People

Human Relations

Social Studies

Business

Economics

Politics

Law

Technology

Religion

Astronomy

Meteorology

Fashion

Programming

Manufacturing

Daily life

Mathematics

Health

Medicine

Education

Biology

Japanese

Physics

Chemistry

Geography

Science

History

Linguistics

Literature

Performing Arts

Art

Music

Transport

Food

Recipes

Leisure

Games

Sports

Industry

Performance Evaluation Results of the Data

With the Qwen3 model, the thought process enclosed in 'think' tags often became lengthy depending on the task—particularly in multi-turn conversations.

In fact, for math and reasoning tasks in the Japanese MT-Bench, there were many cases where the model engaged in extremely long trial-and-error thinking and failed to reach a conclusion.

In environments with limited token availability, tests showed that avoiding reasoning sometimes yielded higher scores.

However, by fine-tuning with our reasoning dataset, the model was able to reason in Japanese while also suppressing redundant inference, resulting in faster inference even with token count and memory usage constraints.

Figure 2 are the evaluation results from Japanese MT-Bench under a restricted maximum token output setting *¹

( *¹ All results were generated using 4-bit quantization, with a maximum output of 4,096 tokens.)

Figure 2: Japanese MT-Bench under a restricted maximum token output setting.

Figure 2: Japanese MT-Bench under a restricted maximum token output setting.

The 'Baseline (Qwen3)' refers to the score of the standard Qwen3 model with reasoning enabled as an option.

'+FineTuning' indicates the score after fine-tuning using 100 samples from the included dataset, combined with synthetically generated data created under the same conditions.

In the Japanese MT-Bench, there are 10 questions for each of the 8 categories shown under 'Category.'

The answers were automatically evaluated using OpenAI's GPT-4.1 model API, with scores given on a 10-point scale. The table shows the average of these scores. *² *³

(*² Additionally, during evaluation by GPT-4.1, a Chain-of-Thought (CoT) process prompting the model to explain its reasoning was added for validation.)

(*³ Since output variability occurs during generation, the scores represent the average of four repeated runs of the same benchmark test.)

The 'Total' score represents the average of the scores across all eight categories.

As noted above, improvements were observed across all levels, including categories involving reasoning.

This suggests that the model is now able to generate appropriate responses even with a limited number of tokens, effectively enhancing its performance in Japanese.

This dataset is also publicly available on Hugging Face at the following link:

For our existing clients, it will also be shared soon through our email newsletter. We hope it helps accelerate your AI development and enhance accuracy. Feel free to make full use of it!

About APTO, Inc.

APTO provides AI development support services focused on data, the most critical factor influencing accuracy in AI development.

Our offerings include:

  • harBest, a data collection and annotation platform utilizing crowd workers

  • harBest Dataset, which accelerates the preparation of data, a common bottleneck in early development stages

  • harBest Expert, which enhances data quality using the knowledge of field experts.

By supporting AI development projects that face data-related challenges, we have earned the trust of many enterprise clients both in Japan and abroad.

We provide support for AI data, model development, GPU resources, and a variety of other needs. If you're facing challenges in AI development, please feel free to reach out to us.

CONTACT: Katina Nguyen,

查看原始文章

Xinhua Silk Road: Coffee industry-related trade co-op highlighted at side event of China-Africa expo

PR Newswire (美通社)

Discover the Ultimate Family Getaway at Premier Residences Phu Quoc Emerald Bay

PR Newswire (美通社)

How Solar Storage is Shaping the Future of Clean Energy After a Decade of Transformation

PR Newswire (美通社)
查看更多
Loading...
Loading...
Loading...
Loading...
Loading...
Loading...
Loading...
Loading...

最新內容

Xinhua News | Hamas, PIJ say peace talks must ensure Israeli army withdrawal

XINHUA

Imugene Announces Outstanding Response Rates from the Phase 1b Trial of the Azer-cel Allogeneic CAR T in 3L+ DLBCL

PR Newswire (美通社)

Discover Opportunities to Buy International Homes at the Global Property Expo, Singapore

PR Newswire (美通社)

Agoda: Surabaya, Indonesia, is the Cheapest Destination in Asia This Summer

PR Newswire (美通社)

Digital Domain Supports with DMD Teen 'Tszkin' by Creating a Personalized AI Virtual Human to Help Achieve His Dreams

PR Newswire (美通社)

Agoda Reveals Iloilo Ranked Fifth Most Affordable Summer Destination in Asia

PR Newswire (美通社)

Cainiao Expands APAC Supply Chain, Now Operating Warehousing and Fulfillment in 10 Markets

PR Newswire (美通社)

Palmer shines as Chelsea crushes PSG to win FIFA Club World Cup (updated)

XINHUA

Chelsea coach Maresca "no words for the players" after winning Club world cup

XINHUA

Driving Global Manufacturing Excellence | Topband Vietnam Facility Sets New Benchmark

PR Newswire (美通社)

China invites journalists from home, abroad to cover victory anniversary events in Beijing

XINHUA

China takes recurve women's team silver at Archery World Cup

XINHUA

Sinner beats Alcaraz to win maiden Wimbledon title

XINHUA

China's foreign trade up 2.9 pct in H1

XINHUA

Mounds claim their rightful place in history

PR Newswire (美通社)

Xinhua News | China invites journalists from home, abroad to cover victory anniversary events in Beijing

XINHUA

Departing from Nagoya, Easily Explore the Highlights of Central Japan - Leverage Meitetsu for an In-depth Journey Through the Tokai Region

PR Newswire (美通社)

Culture Meets Craft: Chow Tai Fook’s Timeless Harmony high jewellery bedazzles in Hangzhou

Tatler Hong Kong

METABORA Partners with LINE NEXT to Distribute Web3 Games via Mini Dapp

PR Newswire (美通社)

Global Times: Xixia Imperial Tombs: cultural fusion of diverse traditions behind World Heritage Site status

PR Newswire (美通社)

ECRL mega rail project marks another milestone with breakthrough of Genting Tunnel

XINHUA

Afghanistan-Pakistan trade grows to nearly 1 bln USD in H1

XINHUA

Malaysia leads Southeast Asia IPO performance in first half of year

XINHUA

Hong Kong's financial ties with ROK strengthened amid enhanced regional connectivity

XINHUA

GLOBALink | Brazilian cardiologist upskills in China to benefit patients back home

XINHUA

Daily World Briefing, July 14

XINHUA

CARsgen Successfully Defends Its GPC3 CAR-T Patent at the EPO

PR Newswire (美通社)

German finance minister urges EU to push back if tariff talks with U.S. fail

XINHUA

LinqAlpha Partners with Microsoft via the Majung Program to Build Secure Cloud Native Financial AI Infrastructure

PR Newswire (美通社)

One month after Israeli surprise attack, Iranians stay vigilant

XINHUA

PKK disarmament opens "new page in history" for Türkiye: Erdogan

XINHUA

46 killed by Israeli attacks across Gaza: civil defense

XINHUA

Roundup: Title favorites off to winning start at FIBA Women's Asia Cup

XINHUA

Chinese FM meets Russian counterpart on SCO cooperation

XINHUA

China routs Indonesia in FIBA Women's Asia Cup

XINHUA

InPics | Tianjin's "4-in-1" urban renewal program opens to public

XINHUA

Eritrea's Mulueberhan, XDS Astana win Tour of Magnificent Qinghai

XINHUA

Songshan Lake: a microcosm of China's innovation ecosystem

XINHUA

Taiwan youth carve out their futures on mainland

XINHUA

Interview: Global Civilization Initiative significant for world peace, says veteran media professional

XINHUA