時(shí)間:2024-02-18|瀏覽:5270
文章轉(zhuǎn)載來(lái)源:AIcore
文章來(lái)源:量子比特
圖片來(lái)源:無(wú)界AI生成
目前全球最受關(guān)注的技術(shù)團(tuán)隊(duì)是哪支?
Sora 團(tuán)隊(duì)成為了人們關(guān)注的焦點(diǎn)。
項(xiàng)目負(fù)責(zé)人的評(píng)論區(qū)不僅人滿(mǎn)為患,還成為最受歡迎的“景點(diǎn)”。
才華橫溢的成員們的履歷也持續(xù)受到關(guān)注。
△來(lái)自微博博主@木雅
大家發(fā)現(xiàn)這支隊(duì)伍相當(dāng)年輕:兩位帶頭人去年(2023年)剛剛博士畢業(yè),隊(duì)伍里甚至還有2000年后的球員……
但它也確實(shí)很棒:
Tim Brooks,DALL-E 3 的作者之一,GitHub 5.7k 項(xiàng)目 InstructPix2Pix 的作者,2021 年至 2022 年在 NVIDIA 實(shí)習(xí)時(shí)是視頻生成研究的項(xiàng)目負(fù)責(zé)人。
William (Bill) Peebles 與 Xie Senin 合作開(kāi)發(fā)了 Sora 的技術(shù)基礎(chǔ)之一——DiT(擴(kuò)散變壓器)。該論文還入圍了 CVPR 2022 最佳論文候選名單。
……
今天我們就來(lái)詳細(xì)聊聊這支球隊(duì)的來(lái)歷。
由應(yīng)屆畢業(yè)生帶領(lǐng)的團(tuán)隊(duì)
包括Tim和Bill在內(nèi),Sora的主要負(fù)責(zé)人有3人(以下排名不分先后)。
Tim Brooks 也是 DALL-E 3 的作者,去年 1 月剛從加州大學(xué)伯克利分校獲得博士學(xué)位。
Tim 就讀于卡內(nèi)基梅隆大學(xué),主修邏輯和計(jì)算,輔修計(jì)算機(jī)科學(xué)。在此期間,他在Facebook的軟件工程部門(mén)實(shí)習(xí)了四個(gè)月。
2017年,蒂姆本科畢業(yè)后,先是在谷歌工作了近兩年,在Pixel手機(jī)部門(mén)研究AI攝像頭,隨后前往伯克利AI實(shí)驗(yàn)室攻讀博士學(xué)位。
在伯克利攻讀博士學(xué)位期間,Tim 的主要研究方向是圖像和視頻生成。他還在 NVIDIA 實(shí)習(xí)并領(lǐng)導(dǎo)了??一項(xiàng)視頻生成研究。
回到校園后,Tim 與導(dǎo)師 Alexei Efros 教授和博士后 Aleksander Holynski(現(xiàn)就職于 Google)共同開(kāi)發(fā)了 AI 圖像編輯工具 InstructPix2Pix,并入選 CVPR 2023 亮點(diǎn)。
去年1月,Tim順利博士畢業(yè),加入OpenAI,并先后參與了DALL-E 3和Sora的工作。
值得一提的是,蒂姆不僅在專(zhuān)業(yè)領(lǐng)域擁有很高的技術(shù)水平,而且還是一位多才多藝的人。
據(jù)蒂姆本人介紹,他還喜歡攝影和音樂(lè)。他高中時(shí)拍攝的照片獲得了國(guó)家地理雜志的獎(jiǎng)項(xiàng)。他曾在百老匯演出并榮獲 B-box 國(guó)際獎(jiǎng)項(xiàng)...
與蒂姆就讀同一所學(xué)校并于四個(gè)月后畢業(yè)的威廉·皮布爾斯也是索拉的另一位負(fù)責(zé)人。
(Peebles uses the nickname Bill on ? and the full name William on Linkedin and when signing papers. He will be referred to as Bill below.)
Bill studied at MIT as an undergraduate, majoring in computer science. He participated in research on GAN and text2video, and also interned in NVIDIA's deep learning and autonomous driving team, studying computer vision.
After graduation and before officially starting his Ph.D., he also participated in a summer internship at Adobe, and his research was still on GAN. This project and (then) Chinese scholar Zhu Junyan of Carnegie Mellon University (also a student of Professor Efros, now at MIT) The group has cooperated and became a candidate for the best paper in CVPR 2022.
After that, at the beginning of the semester, Bill went to Berkeley to study for a doctoral degree in Professor Efros's research group. His research results were selected into academic conferences such as SIGGRAPH, ICCV, and CVPR for many times.
In May 2022, Bill went to Meta for a half-year internship, and collaborated with Xie Saining (Bill had not left Meta when he started his internship) to publish the DiT model, which combined the Transformer and the diffusion model for the first time.
This result was accepted as an Oral paper in ICCV 2023. It is worth mentioning that Sora, released by OpenAI this time, is believed to be built based on DiT.
In May last year, Bill also graduated from Berkeley and joined OpenAI.
In addition to these two researchers who joined last year, Aditya Ramesh, another leader of the Sora team, is the "old man" of OpenAI.
Aditya is the creator of DALL-E and has led the research on three generations of DALL-E. He is a co-author on all three versions of the paper.
And such a master who led three generations of DALL-E and now leads the Sora team only has a bachelor's degree.
According to LeCun, Aditya studied at New York University as an undergraduate and participated in some projects in his laboratory.
In the meantime, Aditya was already studying generative models and published a paper with LeCun.
After graduation, Aditya wanted to continue his studies, but was retained during OpenAI's summer internship and became a formal researcher.
Joined after 00
Aditya Ramesh is not the only undergraduate student on Sora’s team.
As mentioned earlier, there is a "post-00s" Will DePue in this team, who just graduated from the Department of Computer Science at the University of Michigan in 2022.
When he was a senior in college, he started a business, a market consulting company called Deep Research, which was later acquired by Commsor.
In July 2023, I joined OpenAI. According to his LinkedIn information, he just joined the Sora project team in January this year.
In addition, neither David Schnurr nor Joe Taylor has a Ph.D. The former graduated from the University of California, Santa Barbara, and the latter graduated from the Art University of San Francisco.
And as Aditya Ramesh himself said, many members of the Sora team are the authors of DALL-E 3.
Including two Chinese, Li Jing and Yufei Guo.
Li Jing is the co-author of DALL-E 3. He graduated from the Department of Physics of Peking University in 2014 and received his PhD in Physics from MIT in 2019. After working as a postdoc at Meta for more than 2 years, Li Jing joined OpenAI in 2022.
Among the Chinese authors is Ricky Wang, who just switched from Meta/Instagram to OpenAI in January this year. The other two, Yufei Guo and Clarence Ng, do not have much public information.
Also new to the job is Conner Holmes. When he was working at Microsoft, he participated in the inference optimization work of DALL·E 3 as a foreign aid, and later joined OpenAI.
Finally, take a look at the full list of authors:
Judging from the team's formation and research foundation, Sora should be OpenAI's latest achievement in the past six months, rather than "it has been around for a long time but has been held back" as reported online.
However, the explosion of Sora and the continued gathering of top talents shocked everyone to reconsider OpenAI's technological leadership.
Just today, the author released Sora's new work, which even includes multi-camera videos of "the same scene".
Netizens’ mood be like:
Now, it’s video generation, what’s next?
Reference links: [1]https://www.wpeebles.com/ [2]https://www.timothybrooks.com/about/ [3]http://adityaramesh.com/about.html
熱點(diǎn):比特幣背后 加入nft 元宇宙加入 加入幣圈 SOL官網(wǎng)