Dialogue with AI trainer from Fudan University: Is MOSS derived from "Wandering Earth 2"? What are its future goals? Model | Training | Wandering Earth 2

Release time:Apr 14, 2024 11:59 AM

Dialogue character: He Zhengfu, artificial intelligence trainer for the MOSS project at the Natural Language Processing Laboratory of Fudan University

Q: What is MOSS and what are its main functions?

Answer: MOSS is a conversational language model that can provide various direct or indirect assistance to people's lives. It can conduct Q&A on daily life knowledge, help check weather, plan travel, etc; It can assist in efficient office work, such as automatically processing tables, generating outlines, drafts, translations, etc., and also master professional knowledge in fields such as finance, healthcare, and education. Many industries are introducing conversational language models represented by MOSS, such as automotive voice assistants, customer service, etc., which will have the effect of cost reduction and efficiency improvement.

Q: What is your specific training process for MOSS?

Answer: The essence of parameters in a large model is a massive matrix, which performs simple, heavy, and repetitive numerical operations on the input text to ultimately obtain the content that needs to be generated. We can collect and "clean" the corpus on the internet, and allow large models to learn knowledge from these corpora. Specifically, the learning process involves constantly "reading" the text and adjusting the internal parameters of the large model to deepen its understanding of language, ultimately obtaining some kind of "intelligence". This process is called training.

In the process of building MOSS, we empower it with powerful capabilities through a three-stage "reading" process. The first stage is the acquisition of basic knowledge. MOSS extensively "reads" almost all texts on the network, and due to its large number of parameters, it is sufficient to cover a vast amount of knowledge. The second stage is the acquisition of dialogue ability. MOSS learns to answer human questions through dialogue by reading dialogue data and utilizing the knowledge acquired in the first stage. The third stage is alignment. Due to the possibility of misleading responses, MOSS will suppress the generation of content that does not comply with human laws and moral ethics based on human feedback, making the answers more objective and rational.

Q: What are the differences between MOSS and ChatGPT?

Answer: ChatGPT's training data covers a wide range and provides a good user experience. As an attempt in the academic community, MOSS aims to share more forward-looking theories and engineering experiences with the academic community by creating an open-source conversational language model.

Q: Is MOSS derived from the movie "Wandering Earth 2"? What are its future goals?

Answer: The name MOSS is related to the movie "Wandering Earth 2", in which the artificial intelligence robot MOSS exhibits extremely strong intelligence and rationality, becoming a powerful assistant to humans. We have seen the enormous potential of artificial intelligence in the development of conversational language models, so we named MOSS, which embodies our expectations for the future development of artificial intelligence technology.

The future MOSS will become increasingly intelligent. We will fully utilize the cloud computing power and resources provided by platforms such as volcanic engines, conduct model iterations and technical exchanges with more peers, continuously explore the technological frontiers of conversational and large-scale language models, and enable artificial intelligence technology to better benefit human society.

Artificial Intelligence Trainer: Making Machines Understand Humans More

Turn on the computer, input the collected sound data such as wind, rain, and stream sound, "clean" the mixed noise, and "train" the hearing aid data model to test its sensitivity in real scenes... Accompanied by the "clattering clattering clattering" sound of fingers tapping the keyboard, Tencent Teana Lab's artificial intelligence trainer Fu Cong's day of work begins.

In recent years, with the continuous development of artificial intelligence technology, this profession known as artificial intelligence trainers has gradually grown. As one of the "digital professions", the emergence of artificial intelligence trainers has accelerated the process of artificial intelligence from technological research and development to industry application, which will generate high economic and social value.

Continuously feeding data to the model

Every time he goes out, Fu Cong always wears a big "earring" on his ear.

This "earring" is actually a test version of a hearing aid. The sounds in earrings come in various forms, including whistling noise, sharp and piercing noise... These noises, amplified by hearing aids, have been a long-standing problem for many hearing-impaired individuals wearing hearing aids.

Fu Cong and his team are trying to use algorithm design and artificial intelligence technology to "train" data models, making hearing aids more "intelligent" in reducing noise, making hearing-impaired people hear clearly, understand, and feel comfortable.

Fu Cong explained that the data model of hearing aids is very small, so it needs to be optimized for different scenarios. Many scenarios are full of challenges, such as a hearing-impaired person eating in a restaurant, surrounded by many people talking, and wanting to chat with the opposite person. The sound around is particularly noisy, and as a normal person, one may not be able to hear clearly, let alone a person with hearing impairment. We hope to use the model to extract the necessary sound, reduce noise, and help more hearing-impaired people.

The ideal is full, but the actual process of developing model algorithms is like a repeated "battle".

The development process of the model can be roughly divided into the following steps: data collection, data cleaning, model training, scenario testing, and algorithm adjustment. After several iterations, testing and adjustment are carried out. If the test results are not ideal, this process needs to be repeated until the optimal effect is achieved. Fu Cong said.

Data collection should be targeted. In order to make the model smarter, it is necessary to collect various special data for different scenarios. Fu Cong and his team members not only need to go to the subway during rush hour, bustling restaurants, and crowded roads to collect hundreds of hours of sound data, but also need to wear hearing aids to experience the differences in these sounds. "For example, the wind may sound like a whirring sound to a normal person, but after wearing the hearing aids, it is very noisy, like going to a KTV to sing, and the sound hits the microphone hard.". To collect various wind noise data, Fu Cong recorded wind sounds in various scenes such as road cycling and seaside storms.

Data cleaning is the process of removing unwanted data. Fu Cong gave an example - the sound of the wind, which can be mixed with the sound of cars honking and people talking in real scenes. When organizing, these data should be excluded and kept as a relatively pure source of wind, so that the model can "recognize" the wind.

Model training is the process of feeding cleaned data to the model. In addition to the special data collected, Fu Cong and his colleagues will also include data such as languages from various countries around the world and some non speech sounds, which basically covers all the noise and speech encountered in people's lives.

Unlike humans, artificial intelligence models do not get tired, irritable, or lose their temper during the training process. Their intelligence depends on model parameters, training strategies, data volume, and so on. "They are like a 'child', becoming increasingly 'smart' and recognizing more and more sounds, which gives me a great sense of achievement," said Fu Cong.

Test patience, meticulousness, and endurance

After the model training is completed, it does not mean that it can be immediately applied to hearing aids for hearing-impaired people, and it also needs to go through a long process of iteration and adjustment.

For example, in order to provide suitable hearing aids for hearing-impaired individuals, the traditional approach is for patients to repeatedly go to offline fitting stores to try them on, which is a complicated process. Fu Cong explained that in general, hearing loss can be classified into three types based on the cause of the disease: sensorineural, conductive, and mixed hearing loss; According to the degree of hearing loss, it can be divided into mild, moderate, severe, and extremely severe hearing loss. The adaptation methods of hearing aids vary for different types.

Is it possible to move the adaptation process online and utilize artificial intelligence algorithms and deep learning capabilities to enable hearing-impaired individuals to perform accurate listening tests online? With this question in mind, Fu Cong began to develop adaptation algorithms. He likened this process to doing application problems, which require searching domestic and foreign literature, searching for existing solutions, using existing knowledge to carry out reasonable imagination, design experiments, and find answers based on specific usage environments.

This process tests the patience and meticulousness of artificial intelligence trainers. When testing the sound quality of hearing aids, different wearing methods correspond to different test results. Fu Cong and colleagues need to design different wearing methods in an "N x N" arrangement and repeatedly conduct experiments to study their impact on sound quality.

This process greatly tests the endurance of artificial intelligence trainers. "The basic literacy of an artificial intelligence trainer is to force oneself to listen to harsh sounds many times." Fu Cong said that this is because the trainer needs to quantitatively measure the sound limit points that hearing-impaired patients can hear normally, and the decibels of these sounds are unbearable for normal human ears. "Many times, I wish I could drop my headphones. After a day of testing, my whole head feels pain.".

After continuous iteration and adjustment, the built-in algorithm hearing aid has finally been completed. The most unforgettable thing for Fu Cong was their first time donating products to Shaoguan, Guangdong. They handed hearing aids to the deaf elderly one by one, turned them on, put on the equipment, adjusted the gain... "Although I had great confidence in the model, I still felt my heart in my throat because before that, the elderly could not communicate normally," Fu Cong said.

He cautiously asked an old man, "Can you hear what I'm saying?"

"Okay," the old man said three words slowly and firmly from his mouth.

"At that time, I felt that what we were doing was quite meaningful," said Fu Cong.

Using technology to solve human needs

Artificial intelligence trainers are a profession that needs to endure loneliness, as they spend a lot of time designing solutions, writing code, collecting data, and training models.

"My secret to overcoming loneliness is interest." Fu Cong's major is communication, many of which are related to signal processing. He usually likes music, so he combines his interests with his major and work, focusing on the audio field. After graduating from university, he participated in many works related to audio signal processing, experiencing various stages of audio algorithms from traditional algorithms to artificial intelligence algorithms, and then to large-scale deep learning.

In Fu Cong's view, artificial intelligence technology is a great tool aimed at liberating humans from a lot of mental labor and replacing currently costly individual labor in a large-scale manner. For the entire society, this is a progress in productivity with enormous social and commercial value.

What is a mature artificial intelligence technology like? Fu Cong believes that there are three stages to go through: first, perceptual intelligence, which focuses on simulating human visual, auditory, and tactile perceptual abilities, such as facial recognition, speech recognition, etc; The second is cognitive intelligence, which has characteristics such as human thinking comprehension, knowledge sharing, action coordination, or game theory. It can truly understand what people are saying and provide relatively complete answers based on some prompts; The third is behavioral intelligence, which, like autonomous driving, can truly play a role in the physical world.

To achieve this goal, it is necessary to continuously train the artificial intelligence model. Fu Cong stated that the first step is to prepare enough data for the problem, "as much as possible to cover all the situations encountered in solving this problem"; Secondly, it is necessary to design good algorithms and continuously optimize them based on user feedback.

"The field of artificial intelligence technology is advancing rapidly, requiring AI trainers to have broad perspectives, profound humanistic sentiments, and a sense of social responsibility. They should use the latest ideas, concepts, and correct ethics in the industry to help humans solve problems encountered in production and life." Fu Cong said.

Two women were stabbed to death and reported to have committed a crime 4 days before the follow-up visit for schizophrenia. Suspect of a bloody murder case in a Hong Kong shopping mall appeared in court today. Male | Last Friday | Murder case

According to Hong Kong's Wen Wei Po, a bloody knife stabbing case occurred at Hollywood Square in Diamond Hill last Friday. The police arrested a 39 year old man on suspicion of stabbing two young women, one of whom was stabbed over 30 times. The suspect appeared in the Kwun Tong Magistrates Court this morning. The police at the Kwun Tong Magistrate's Court temporarily charged the suspect with two counts of murder last Sunday. The suspect appeared in court this morning at the Kwun Tong Magistrate's Court. Acting Chief Magistrate Zheng Jihang, after listening to the opinions of both the prosecution and defense, decided to postpone the hearing for two weeks until 9:30 am on June 19th, waiting for two psychiatric expert reports to be obtained. The defense did not object. Zheng Jihang approved the application, and the defendant needs to be temporarily detained at Xiaolan Mental Hospital. When the suspect appeared in court, he wore black framed glasses, a light gray shirt, and camouflage green shorts, and was able to answer the judge's questions normally. accordingly

Currently, the highly anticipated summer harvest work in Henan has shifted its focus to the northern region of Henan. According to the Henan Daily client, on June 4th, Lou Yangsheng, Secretary of the Henan Provincial Party Committee, presided over a special video scheduling meeting on the "Three Summers" work in the province, listened to the situation report, analyzed and judged the situation, and arranged and deployed the next steps of work. Governor Wang Kai made specific arrangements. On the evening of May 31, 2023, in Xiafutou Village, Xuliang Town, Boai County, Jiaozuo, Henan Province, villagers braved light rain in the wheat fields to harvest wheat. Visual China Map Lou Yangsheng pointed out that the current summer harvest battle in the province has entered the decisive stage. Doing a good job in summer harvest in northern Henan Province is related to the summer grain yield and seed safety. We should focus on seizing opportunities and make every effort to organize the wheat harvesting work in the northern Henan region, minimize losses, and protect the interests of farmers to the greatest extent possible. Accurate forecasting is essential

Xinhua All Media+| Welcome home! What innovative technologies are protecting the return journey of Shenzhou 15? Spaceship | Shenzhou | Technology

On June 4th, the return capsule of the Shenzhou-15 manned spacecraft successfully landed at the Dongfeng landing site. Astronauts Fei Junlong, Deng Qingming, and Zhang Lu all safely and smoothly exited the spacecraft, and the Shenzhou-15 manned flight mission was a complete success. What innovative technologies are there to safeguard the return journey of Shenzhou 15 in this mission? On June 4th, the return capsule of the Shenzhou-15 manned spacecraft successfully landed at the Dongfeng landing site. Xinhua News Agency reporter Lian Zhen photographed that "the sky and the ground" ensure the high-precision return of spacecraft. For the Shenzhou series spacecraft, the return and re-entry GNC technology is directly related to the life safety of astronauts. Taking the success of this return mission as a symbol, China has comprehensively upgraded its GNC system since the Shenzhou-12 manned spacecraft, which features autonomous rapid rendezvous and docking, autonomous adaptive prediction and re-entry return guidance, and has completed a comprehensive update and replacement

The Chinese naval fleet has arrived! Assembly | Navy | Chinese Fleet

At noon today, a Chinese naval fleet consisting of Zhanjiang and Xuchang ships arrived at the assembly area of the "Comodo-2023" multinational maritime joint exercise. It is understood that the assembly anchorage for this exercise is 3 nautical miles long and 1.5 nautical miles wide, capable of anchoring up to 50 ships. Naval vessels from various countries participating in the exercise will also arrive at the anchorage today to complete the assembly of the "Komodo 2023" multinational maritime joint exercise, which is held every two years by the Indonesian Navy. This year is already the fourth edition of the exercise. The exercise will be held from June 5th to 8th in the city of Jakarta, South Sulawesi Province, Indonesia, including the port and sea phases. In the coming days, participating navies from various countries will participate in ship reading style search and rescue exercises, maritime interception and damage management exercises, aerial exercises, and other course objectives exercises

New comment: Donkey like "morale" limit pulls US debt "bomb" fuse hard to dismantle US | debt | morale

On the evening of June 1st, the US Senate passed a bill on the federal government's debt ceiling and budget, and the flame of the US debt bomb was temporarily extinguished at the last moment. The two parties in the United States have staged an extreme tug of war over the US debt bomb. Some experts believe that the US debt crisis is the result of the reckless politics promoted by the US dollar hegemony, and the underlying cause of this crisis is the highly polarized political system of the US. Since the end of World War II, the US Congress has adjusted the debt ceiling more than a hundred times. The recurring debt crisis will not only have a catastrophic impact on the US economy and people's livelihoods, but also continuously erode the value of US dollar assets such as government credit and US bonds, bringing significant and far-reaching impacts to the global economic landscape. 【

Important news

The boundless scenery is always new-General Secretary Xi Jinping guides the construction of digital society, review and development | education | society Unswervingly Promote and Improve the Comprehensive and Strict Party Governance System Xi Jinping | Strict Party Governance | System "Seeking Truth" magazine published an important article by General Secretary Xi Jinping "Improving the Comprehensive and Strict Party Governance System and Promoting the New Great Project of Party Building in the New Era to Develop in Depth" State | President | Xi Jinping General Secretary's thoughts are still in my mind, learning from the season | building a modern marine ranching power | developing | the ocean Leading the way with a heavy load | More vitality in opening up to the outside world, stronger momentum in science and technology innovation - Strong development momentum in the ancient capital of Xi'an Automotive | New energy | Opening up to the outside world Chinese Stars | 210 Seconds Review of the Victory of Aerospace Heroes on their Way Home! Flying | Divine Boat | Hero

Political situation

What are the remaining obstacles in the investigation and undercover investigation by the law enforcement inspection team of the Municipal People's Congress? This work evaluates that Shanghai has always been at the forefront of garbage classification | garbage | law enforcement Looking forward to the global high-quality engineering enterprise layout in Shanghai, Mayor Gong Zheng meets with the President elect of the World Federation of Engineering Organizations | Global | Quality To jointly discuss and deepen cooperation and exchange in counterpart support, Chen Jining and Nie Zhuang led a discussion on the work of the party and government delegation in Kashgar, Xinjiang Chen Jining and Gong Zheng jointly inspected and discussed, jointly serving the overall development of the country! Yin Li and Yin Yong led a Beijing delegation to Shanghai to inspect the delegation's development and overall situation Contributed about 1/4 of the GDP of Shanghai, currently with 70000 foreign-funded enterprises in Shanghai. The Youth Association of Shanghai Foreign Countries has helped each other in terms of membership and GDP The representative group of Yangpu Street helps to alleviate public concerns, with garbage belts turning back into green belts and street lights lighting up again in residential areas | Representative | Yangpu

Economics

What else can I rely on for growing vegetables?, Agricultural Experiment of a Group of Young People: Lettuce without Heaven or Earth | Physical Strength | Agriculture Very few have been rejected at once... Shanghai Automotive Engine Factory has presented 49 automotive grade chips to solicit domestic alternatives to SAIC | Netherlands | United States | Lingang Pujiang International Science and Technology City | Substitution | Import | Localization | Automotive grade chips Is there still a "flower head" in domestic blockchain?, No Bitcoin, First Blockchain Technology National Standard Released Web3.0 | Blockchain | Domestic These top domestic and international beverage cold drink brands have rolled up like this, in order to welcome the overall recovery of the consumer market as a brand | Sugar Tea | International Making a dojo inside a snail shell allows the city's scraps to make a comeback. This European style garden in Shanghai used to be a "stone" on the road. Shanghai Pocket Park | Sparrow Little Dirty | Making a dojo inside a snail shell 23 industry associations (business and academic) in the Yangtze River Delta have established alliances to provide trade adjustment assistance, and the foreign trade situation is complex and severe. | Trade | Alliance

Regional situation

Shanghai Charity Foundation Successfully Issued First Electronic Donation Receipt Community | Activity | Shanghai Charity Foundation Carry out normalized social theater performance activities, and the "hometown of drama" Songjiang Xinbang will resume stage and social theater | Drama | Songjiang Xinbang How to promote and replicate the "near rail model"? Changfeng Xincun Street Building Party Building 4.0 Model | Building Committee | Jintie The largest single scale near zero energy consumption building in Shanghai will be built, and the construction area of the first batch of "Jiabao Smart Bay" in Jiading New City will start | plot | first batch Jing'an creates "one park, one theme", and the art exhibition "Painting Life and Coloring" opens at Jing'an Sculpture Park to experience | Art | Jing'an The highlights of the Xuhui exhibition area at the first Carbon Expo are frequent, including waste plastic recycling and environmental protection roads, Shanghai's first near zero carbon community, low-carbon | green | community

Viewpoint

"White carbon" is on the rise, and there are many colors of carbon? This is definitely not sensationalism, carbon dioxide emissions, color Where does the trend go: "China-Chic" cultural and creative market status, development trend and innovation path Cultural and creative | Chinese goods | China-Chic He demonstrated the essence of the revolutionary ideology and spiritual qualities of communists, and during his brief 40 years of life, Secretary Yu Xiusong's ideology What is the relationship between the "four major functions" and the "five centers"? Accurately grasp these two key elements | functions | relationships Why mention "Clean Your Plate Campaign" again?, Today's life | leftovers | CDs To find the balance between digitization and greening, experts say that excessive digitization may increase energy consumption. Digital | Green | Digitization

Vision

Celebrating the Second Anniversary of the US Army's Withdrawal, [Looking at the World] Afghan Taliban Armed Security Personnel March Typhoon | Taliban | Afghanistan It's more convenient to buy book exhibition tickets offline today, including ID card, cash, and old-fashioned mobile phones. Lu Xun | Book Exhibition | Mobile Phone These spaces create multiple interactive viewing experiences for citizens and tourists, playing with weekend aesthetics | Design | Citizens The opening ceremony will continue as usual. [Looking at the World] The opening match of the Women's World Cup. A shooting occurred in Auckland City, resulting in three deaths. The World Cup | Event | Opening match World Blood Donor Day: Oriental Pearl TV Tower, Shanghai center and other city landmarks light up Life Red World Blood Donor Day Take the artwork home for a thousand yuan? Many people come to this exhibition for their "treasure hunting" works | gallery | to take home